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INTRODUCTION 


This report summarizes the awards made by the National 
Science Foundation in Fiscal Years 1978-1979-1980 on 
recommendation by the Division of Information Science 
and Technology. 


The report is divided into three major sections. The 
first section presents a summary table of awards and is 
arranged alphabetically by principal investigator. The second 
section, containing descriptions of the work funded, is 


arranged alphabetically by institutions performing the 
work. The third section contains the subject index. 

Requests for additional information about specific 
projects should be addressed to principal investigators. 
General information on the current NSF information sci- 
ence research program and award procedures may be 
obtained from the Division of Information Science and 
Technology, National Science Foundation, Washington, 
DC 20550. 


DEFINITIONS 


The following definitions are essential to an understanding 
of this report: 


e *‘Award” refers to financial support given through a 
grant, contract, interagency agreement, or other 
arrangement, depending upon the nature of the work 
to be completed, the terms of performance, and the 
institution that performs the work. 


e “Amendment” refers to an award which adds funds 
to an existing award with or without increasing the 
duration of NSF support. 

¢ ‘Principal Investigator” refers to the scientist or 


administrator who is responsible for directing an NSF 
supported project. 


e “Institution” refers to any organization which has 
accepted an award from NSF. 


Awards excluded from this report are: 
e Purchase Orders 


e Personnel Services 


The award data presented consist of program grants 
and contracts made from the Salaries and Expenses appro- 
priations to the Foundation for Fiscal Years 1978-1980. 
Information contained in the report was obtained from 
program sources and may differ from official NSF records 
which may have different inclusions and exclusions. 


FY 1978 AWARDS 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


Award 
Principal Award Date/ Summary 
Investigator(s) Institution Title No. Amount Duration No. 
(Month) 
Beresford, John Data Use and Conference on a National- $OC77-20334 $ 68,300 12/12/77 034 
Access Labs, Inc. _ Level Program of Access to 12 
Machine-Readable Data Files 
Bernard, Russell West Virginia Accuracy of Data on IST78-12801 $ 39,403 08/14/78 156 
Univ. Information Exchange by 12 
Electronic Network 
Bommer, Michael Clarkson College Analysis of the Operational IST78-12093 $ 76,152 08/30/78 026 
Chorba, Ronald of Technology Parameters for the Management 12 
Caruso, Elaine U/Pittsburgh Training Modules for Use of IST77-26525 $ 67,875 02/23/79 125 
Scientific & Technical 12 
Information Services Phase II 
Copeland, Richard Bethune-Cookman Measurement and Improve- IST78-11407 $ 89,535 06/13/78 007 
College ment of On-Line Chemical i8 
Searching Performance in Three 
User Groups at Small Colleges 
Coward, H. Roberts Inst. for Scientific Research Needs and Applica- IST78-18730 $ 18,142 09/11/78 063 
Information tions for Indicators Based on 6 
the Scientific and Technical 
Literature 
Debons, Anthony U / Pittsburgh A Survey of Manpower IST77-27115 $101,417 06/26/78 126 
Requirements for Scientific & 7 
Technical Communication 
Dunn, Donald Stanford Univ. Economics of Computer MCS78-02272 $150,000 08 /03/78 140 
Communication Networks 12 
Emery, James EDUCOM Factors Influencing Network MCS78-07383 $362,269 09/30/78 037 
Sharing of Computer Resources 20 
in Research 
Goffman, William Case Western Theoretical and Experimental IST 78-06292 $ 71,588 04/12/78 024 
Reserve Univ. Investigation of the Dynamics 24 
of Information 
Hall, Homer Rutgers Univ. Services in Industry for the IST78-16628 $ 49,624 08/30/78 138 
Analysis and Evaluation of 24 
Information 
Harris, Virginia Oberlin College Research on the Impact of a IST78-10821 $139,980 09/21/78 114 
Computerized Circulation 24 
System on the Performance of 
a Large College Library 
Harris, Zellig U/Pennsylvania Correlation of Language IST78-03863 $163,329 04/10/78 120 
Structure with Information 24 
Hessman, John Electronic Indus- _—_ Electronic Information IST 78-09189 $ 77,113 07/24/78 038 
tries Association Exchange in Microcomputer 18 
Component Standardization 
Research 
Isenhour, Thomas U/North Carolina Computer Applications to CHE78-00632 $ 71,000 05/15/78 110 


Analytical Chemistry 


12 


FY 1978 AWARDS—CONTINUED 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


. Award 
Principal Award Date/ Summary 
Investigator(s) Institution Title No. Amount Duration No. 
(Month) 
Kent, Allen U/Pittsburgh The Economics of Information IST77-17635 $ 84,900 01/10/78 130 
Transfer Using Resource 18 
Sharing Networks-Network 
Functions 
King, Donald King Research, Development of Statistical IST78-04789 $ 9,281 11/15/77 068 
Inc. Indicators of Scientific and 0 
Technical Communication 
King, Donald King Research, Data Gathering for Statistical IST77-17915 $147,442 11/22/77 069 
Inc. Indicators of Scientific and 12 
Technical Communication 
from Primary Sources 
King, Donald King Research, Systems Analysis of Scientific IST77-28438 $ 11,394 12/08/77 070 
Inc. and Technical Communication 6 
in the United States 
King, Donald King Research, Evaluation of Information IST78-21131 $ 30,000 09 30/78 071 
Inc. Science Research 6 
Kochen, Manfred U/ Michigan Links of Interest and Expertise IST78-16629 5 49.621 08 14/78 091 
Among Scientists 12 
Lancaster, F. W. U / Illinois Impact of a Paperless Society IST78-04768 $ 50,540 06/11/78 051 
on the Research Library of the 12 
Future 
Levitan, Karen Mitre Corporation Systematic Process Evaluation IST78-21078 $ 29.847 06/11/78 OS! 
of Fundamental Information 6 
Science Research | 
Machlup, Fritz New York Univ. An In-Depth Analysis of a IST78-13383 $160,178 07/10/78 104 
Comprehensive Data Base on 12 | 
the Communication of Scien- | 
tific /Scholarly Knowledge 
Martin, Thomas U /Southern Design for Evaluating the IST78-21130 $ 28,589 08 23/78 139 | 
Danowski, James California Centrality of Basic Information 6 
Science Research to Research 
Users in Organizations 
Meadow, Charles Drexel Univ. The Design & Assessment of IST77-26524 $229,600 02/13/78 036 | 
Individualized Instruction in 24 
Bibliographic Data Base Access 
Mick, Colin Applied Commu- Development & Testing of a IST78-10532 $103,350 02/13/78 003 
nication Research Methodology to Assess Private 2 | 
Investment for Scientific and | 
Technical Information | 
Mick, Colin Applied Commu- Assessing the Information IST78-10531 $ 69,309 06 29 78 004 
nication Research Behavior of Scientists & 12 
Engineers-A Secondary Data 
Analysis 
McGill, Michael Syracuse Univ. An Evaluation of Factors IST78-10454 $ 67.857 05 09 78 148 
Affecting Document Ranking 12 
by Information Retrieval 
Systems 


Nd 


DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1978 AWARDS—CONTINUED 


Award 
Principal Award Date Summary 
Investigator(s) Institution Title No. Amount Duration No. 
{Month} 
Niehoff, Robert Battelle Memorial Extension, Refinement and IST77-04498 $106,186 05 26 78 006 
Inst. Evaluation of On-Line Auto- 12 
mated Subject Switching 
Nisenoff, Norman Forecasting A Study of the Potentia! Influ- IST78-12102 $178,775 09 13°78 O42 
International ence of Social, Economic, 24 
Regulatory & Technologica! 
Factors on Scientific & 
Technical Communication 
Through 2000 A.D. 
O'Connor, John Lehigh Univ. Uses of Citing Statements to IST78-01054 $116,037 07/05/78 073 
Improve Retrieval 24 
Olson, Edwin NTL Inst. for Development of a Conceptual IST78-19923 $ 49.61! 09,07 78 112 
Lake, Dale Applied Behavioral Framework to Guide Research 12 
Science in Information Science 
Paisley, William Stanford Univ. Developing a Mode! and IST 77-28242 $ 65,000 OY 21,78 144 
Methodology for Evaluating 12 
Practitioner Information 
Sysiems: The Case of Pacfornet 
Riley, James Library of Evaluation of the Use of Slow- IST78-12101 $145,905 OS 23 78 076 
Congress Scan jclevision and Tele- 24 
fecsimiic ina Library Network 
Rummer, Dale U Kansas improving the Use of IST77-23193 $ 6).42! O01 31,78 067 
Computer-Based Information 1§ 
Resources Within the 
Engineering Classroom 
Sager, Naomi New York Univ. Research iato Methods for 1IST77-24530 $211,000 12/29/77 107 
Auioisestic Classification & 24 
Fact Reirievel in Science 
Subtields 
Sager, Naomi New York Univ. Data Base Management IST78-1 3939 $188,943 09 07 78 108 
Systems for Naiural Language 24 
Data Bases 
Salton, Gerard Cornell Univ. Research on Combining Docv- 1ST77-04843 $133,555 11 18 78 030 
ment and Data Retrieval 24 
Technologies 
Saltzer, Benjamin Naval Ocean Study of Impacts of a formal IST77-23192 $ 71.89! 05 23,78 099 
Systems Center Computer-Based Informaton 18 
System on Information 
Networks 
Schipma, Peter IIT Research Video Full Text Processing IST78-11381) $222.570 O08 23 78 049 
Institute 18 
Sheridan, Thomas Massachusetts Electronic Information IS7 78-06749 $148,454 07 19 78 ORS 
Institute of Exchange in Mental Workiead 18 
Technology Research 
Shuchman, Hedvah The Futures The Effects of STI Changes IST77-20839 $129,927 06 13 78 O44 
Group on the Engineering Profession 1S 
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DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1978 AWARDS—CONTINUED 


Award 
Principal Award Date Summary 
Investigator(s) Institution Title No. Amount Duration No 
{Month} 
Small, Henry Inst. for Scientific The Structure & Development IST78-16677 $ 32,689 O08 23,78 062 
Pinski, Gabriel Information of the Information Science 12 
Literature: A Citation Study 
Stevens, C. H. Participation Electronic Information IST77-27940 $150,575 07 24 78 119 
Systems Exchange in Legislative 18 
Technical Research 
Turoff, Murray New Jersey Insti- Electronic Information IST77-2 1008 $275,197 O8 03 78 101 
tute of Technology Exchange Test Facility 18 
Waltz, David U_ Illinois Theoretical Issues in Natural iS 1T78-18228 $ 14.242 06 26 78 O52 
Language Processing-2: An 6 
Interdisciplinary Conference 
White, Herbert Indiana Univ. The Economics of Library & IST78-09182 $ 61,025 0613.78 059 
Subscriptions: A Sample Survey 12 
Wilcox, Alice Minnesota Higher A Conference on Serial Collec- IST78-18103 $ 32,469 09°29 78 093 
Educ. Coordinat- —_ tions in a Network Environment 12 
ing Board 
Williams, Martha U_Illinots An Integrated Man, Machine IST77-26886 $174,432 02 14.78 OSS 
Interface to Facilitate Network 18 
Resource Utilization 
Zinn, Karl U_ Michigan Computer-Based System for IST7&-03962 $ 20,000 OS 0S 78 = ORO 
Sharing Technical Information 12 


FY 1979 AWARDS 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


Award 
Principal Award Date) Summary 
Investigator(s} Institution Title No. Amount Duration No. 
{Month} 
Agrawal, Dharma Wayne State Univ. Subgraph Identification Using 1ST79-18462 $ 61.470 07 30 79 154 
Associative Techniques with 24 
Applications to Information 
Science and Chemical Structure 
Investigation 
Allen, Beth Elaine U_ Pennsylvania Alternative Representations of IST79- 18464 $ 77,987 07 30 79 121 
Information in Microeconomic 24 
Systems 
Braunstein, Yale Brandeis Univ. Economics of Market Struc- IST79-15235 $156,523 09 06 79 3009 
ture in Scientific and Technical 24 
Information Activites 
Chapanis, Alphonse Johns Hopkins Human Requirement: in IST78-20923 $195,841 02,05 79 066 
Weeks, Gerald Univ. Information Retrieval 24 
Cohen, Karen Massachusetts Experimental Evaluation of IST78-20362 $131,746 09 2679 Ok? 
Institute of Information Delivered in 24 
Technology Various Modes and Combina- 
tion of Modes 
Cooper, Franklin Haskins Labora- Verification of Syllable MCS79-16177 $109,475 08 30 79 347 
tories Hypotheses in Automatic 12 
Phonetic Analysis of Spoken 
English 
Cooper, William U California, Bayesian Retrieval System IST 79-17566 $ 97,125 O8 31,79 =O 
Berkeley Design 24 
Cozzens, Susan Inst. for Scientific User Requirements for Scien- IST78-21947 $ 31.421 1107/78 064 
Information Reviews 12 
Debons, Anthony UL Pittsburgh A Survey of Manpower IST 79-09524 $ 20,440 04 02 79 127 
Requirements for Scientific & 12 
Technical Communication 
Debons, Anthony UL Pittsburgh A Survey of Manpower IST79-18769 $126,874 06 20 79 128 
Requirements for Scientific & 12 
Technical Communication 
Debons, Anthony U_ Pittsburgh Support of U.S. Participation IST78-16989 $ 15,189 O8 23°79 129 
in NATO Advanced Study 12 
Institute in Information Science 
Dunn, Donald Stanford Univ. Economics of Computer MCS78-02272 = $157,172 07 01:79 14) 
Communication Networks 12 
Evens, Martha Illinois Inst. of Structuring the Lexicon and IST 79-18467 $ 43,575 O08 06.79 = (SK 
Technology the Thesaurus with Lexical- 24 
Semantic Relations 
Flowers, John UL Nebraska Human Attention and Per- IST79-11162 $ 34.672 07 30 79 100 
formance Limitations in 12 
Processing Visual Information 
Friedman, Joyce U Michigan Computer Studies in Formal BNS79-05879 = §_- 87, SOR 06 28 79 090 


Linguistics 


DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1979 AWARDS—CONTINUED 


Award 
Principal Award Date) Summary 
Investigator(s} Institution Title No. Amount Duration No. 
{Month} 
Gell, Marilyn National Com- Electronic Information IST79-21035 $ 40,335 08 27/79 = O98 
mission on Librar- Exchange in Planning for White 7 
ies and Informa- House Conference on Library 
tion Science and Information Services 
Gerstenfeld, A. Worcester Poly The Critical Review Process: IST79-16483 $147,255 O08 24.79 159 
Franke, Richard technic Institute Impact Upon Scientific and 24 
Technological Information 
Hammond, Kenneth UU Colorado Designing Information Systems IST78-19867 $105,531 05,02,79 = 027 
Cook, Richard for More Effective Use in 12 
Taylor, Robert Decision Making 
Hayes, Robert U California, Mathematical Models of Infor- IST79-18497 $196,640 O8 14.79 O17 
Borko, Harold Los Angeles mation System Use 24 
Hickey, Thomas OCLC, Inc The Probabilistic Matching IST79-18263 $ 42,32) O8 21,79 115 
and Control of Author Names 12 
in Automated Library Systems 
Joshi, Aravind U_ Pennsylvania Computer Science and Com- MCS79-07558 $175,100 07 02/79 122 
puter Engineering Research 12 
Equipment 
Kent, Allen U_ Pittsburgh Economics of Information IST79-1 1886 $233,496 O08 13°79 134 
Transfer Using Resource 0 
Sharing Networks— Network 
Modeling Simulation 
Kidd, Jerry lL Maryland Research on Behavioral IST79-04896 $163.01) O8 22.79 079 
Variables Related to Informa- 1s 
tion Systems 
Langley, Patrick C arnegie- Mellon An Information Processing IST79- 18266 $ $3,715 07 3% 79 = =022 
Univ Theory of Procedural Learning 24 
Lehnert, Wendy Yale Univ Text Processing Effects and IST79-18463 $291 SRK O08 03°79 160 
Schank, Roger Memory Recall 24 
Abelson, Robert P 
Loy, James National Bureau Support of the Numerical Data IST79-20632 $ 12.300 08 30,79 = (097 
of Standards Advisory Board 12 
Maier, David SUNY, Stony Relational Theory of Data- IST 79-1 8264 $ 89.655 O08 O8 79 146 
Brook bases 24 
McCarroll, Jane Innovative Electronic Information IST79-18251 $ 20.275 06 06 79 061 
Research, Inc Exchange in Research on 12 
Devices for the Disabled 
Michael, Elizabeth Tymshare, Inc Innovations in Engineering IST 79-06604 $ 3.695 12 1S 78 182 
Publications 4 
Nicholf, Robert Battelle Memorial Advanced Concepts and IST79-1 1190 $148,900 09 21 79 (OS 
Inst Studies of a Vocabulary 12 
Switching System 
Parsons, Charles Columbia Uni Normal Sentence Forms and IST79-19203 $354,377 9 06 79 029 
Levi, Isaac Information Structure 24 


DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1879 AWARDS—CONTINUED 


Award 
Principal Award Date) Summary 
Investigator(s} Institution Tule No. Amount Duration No 
{Month} 
Pearl, Judea U California, Investigating Computational MCS78-18924 = $ 68,75! 03 09/79 O18 
Los Angeles Gains Using Partial Information 12 
Perlmutter, Jane LU Texas, Austin Representation, Retrieval and IST79-18474 $ 63,952 07/25/79 150 
Reorganization of Conceptual 24 
Information 
Pollock, Joseph American Chemi- Automatic Spelling Error IST78-20175 $153,160 05 31/79 002 
cal Society Detection and Correction for 
Large Data Bases 
Reinyes, J. Francis Massachusetts A Research Program on Elec- IST7&-23251 $ 83,725 02 14/79 OR4 
Inst. of Technology tronic Document-Delivery 12 
Systems for Interlibrary Loans 
Roderer, Nancy King Research, Evaluation Measures and IST79-18479 $143,864 08/1479 307) 
Wiederkehr, Robert Inc Methods for On-Line Bibho- 1S 
graphic Systems 
Salton, Gerard Cornell Univ Mathematical Modelling in IST) 9-05301 $145,040 0$03,79 = O31 
Automatic Information 24 
Retrieval 
Salton, Gerard Cornell Univ Fast Computational Processes IST79-09176 $ 29.605 07/27/79 32 
in Information Retrieval 12 
Slamecka, Vladimir Georgia Inst. of Syntactic Structure of Infor- IST78-27002 $137,790 OF 0179 O45 
Technology mation 2d Information 24 
Processing 
Sullo, Pasquale Rensselaer Poly- A Reviability Theoretic IST7&-16414 $ 72.800 (6 22 79 135 
Wallace, Wilham techmic Institute Construct for Document Usage 12 
and Information Flow 
Tannebaum, Aaron Brooklyn College — Efficiency of Sequential Search IST79-1 7568 $ 38,376 072779 OW 
Algorithms 12 
Tou, Juhus U Florida Development of Theory and IST 79-08 S48 $166,740 On03 79 8 O89 
Techniques for Automated 4 
Know! ge Utilization, 
Transfer & Extension 
Tsao, Yao-chung Rensselaer Poly- Information Processing of IST79- 18582 $ 27.850 O08 06 79 1 
technic Institute Logographic and Alphabetic 12 
Writing Symbols 
Webber, Bonnie U_ Pennsylvania Research in Natural Language IST79-18476 S$ 83,S9R OR 0179 123 
Man-Machine Interaction 24 
Werdel, Judith National Academy — International Scientific and INT79-0293) $ 96.500 03 06 79 09s 
of Sciences Technical Information Programs 12 
Wood, James Council of ANSI Committee 239 IST78-25323 $ 20,000 08/10/79 = aa 
Frase, Robert National Library 12 
Association 
Yovits, Marshall Ohio State Uni Extension & Applications of a IST 79-08 327 $105,670 07/16/79 117 
Theory of Information Flow 24 


and Analysis: The Use of 
Information in « Decision- 
Making Environment 


FY 1979 AW ARDS—CONTINUED 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


Award 
Principal Award Date Summan 
Investigator(s) Institution Title No Amount Duration No 
{Month} 
Zadeh, Lotfi U Cahfornia, A Theory of Approximate MCS79-06543 = $_-29,000 0% 09,79 O14 
Berkeley Reasoning 12 
Zunde, Pranas Georgia Institute Study of Potential New Appili- IST 79-17567 © 82.449 09 10 79 046 
of Technology cations of Information Theory 12 
to Problems of Information 
Science 


FY 1980 AWARDS 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


Award 
Principal Award Date Summary 
Investigator(s) Institution Title No. Amount Duration No. 
{Month} 
Allen, James F UL Rochester Recognizing Intent in Natural ISTSO-12418 $ 80,000 O08 15/80 137 
Language 24 
Anderson, James Brown L nov Cognitive Applications of BNS79-23000 = § 44,602 03/31/80 3 Ol; 
Matrix Memory Models 2 
Becker, David S IIT Research Inst. An Augmented English IST79-20267 $ 83.285 0626 80 050 
Dictionary For Information 12 
Retrieval 
Bernard, H. Russell UL Flornda Degradation of Accuracy in IST79-24965 $ 19,403 12/18/79 + 040 
Social Network Data 12 
Brown, Donna J U_ IMhnois, Storage and Access Costs in IST80-12240 §=$ 62,663 68/14/80 53 
Urbana Data Structures for Informa- 24 
tion Retrieval Problems 
Chase, Karen H Pennsylvania State Completions of Relations and IST80-12239 $ 36.6380 06 / 26 /80 124 
Univ Embedded Join Dependencies 12 
in Relational Databases: A 
Study Using Boolean Matrices 
Chu, Wesley W U Calhforma, File Allocation and Query IST80-02671 $112,220 07/17 80 =6019 
Los Angeles Opturmization for Distributed 24 
Data Bases 
Croft, W. Bruce L Massachusetts, A Generalized Model of IST8O-1 1605 $ 61,740 06/27/80 OB! 
Amberst Document Retrieval 4 
Dunn, Donald Stanford Unis Economics of Computer MCS80-12875 $166,458 08 01/80 142 
Communication Networks 12 
Eastman, Caroline Flonda State Muluple Attribute Retrieval IST80-1 1608 $ 24,063 08 07 80 864 
Univ Techniques for Tree-Structured 12 
Information 
Ellis, Carla § L Oregon Concurrency in the Access and ISTSO-; 244] $ 62,500 06, 18/80 118 
Manipulation of Information 24 
Structures 
F almagne. New York Uni Probabilistic Measurement IST79-24526 $156,909 04 /02 /80 103 
Jean-C laude Theoretic Models of Informa- AD 
tion Processing 
Fu. King-Sun Purdue Unis Research on Error-Correcting IST 79-1 8884 $151,794 11/05/80 132 
Techniques in Automated % 
Information Processing 
Grossberg, Stephen Boston Univ Decision-Making and Infor- IST80-00257 $157,990 04/02/80 O08 
mation Processing in Real- % 
Time Network Models 
Harris, Zelhg Columia Univ Information Correlates of ISTRO-17043 $ 74,989 09/23/80 O28 
Basic Language Structures 24 
Haseman, Wiliam D — Carnegie-Mellon Distributed Data Base Design IST79-18472 $ 9.211 12/07/79 = 023 
Uni 6 
Haseman, Wilham DU Wisconsin, Distributed Data Base Design IST&O-19259 $243,651 08 04/80 158 
Milwaukee 18 


—- 


DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1980 AWARDS—CONTINUED 


a Award 
Principal Award Date Summan 
Investigator(s) Institution Title No. Amount Durztion No 
{Month} 
Hillman, Donald Lehigh Univ. A Theoretical Basis for ISTSO-12234 $123,551 06 19 80 74 
Knowledge Management 24 
Hiltz, S. Roxanne Upsala College Workshop to Compare and ISTS8O- 18077 $ 43.711 06 10 80 133 
Synthesize Evaluations of 6 
Computer-Based Information 
Exchange 
Huber, George LU Wisconsin, A Study of the Distribution, IST79- 1 3382 5 91.214 01 23 80 187 
Madison Processing, and Use of Scientific 12 
and Technical Information in 
Research Organizations 
Isenhour, Thomas U/North Carolina) Computer Applications to CHES0-40035 = -$ 66,500 06 11 80 iti 
Analytical Chemistry 12 
Kaske, Neal K. OCLC, Inc Development of Generalized IST79-20312 $ 97.902 OS 22 80 116 
Algorithm to Determine 12 
Terminal Requirements for 
On-Line Catalogs in Libraries 
Knuth, Donald E. S : Theoretical Basis for the IST79-21977 =. $3.48, 707 01 23 80 143 
Development of Document 4 
Preparation Systems 
Kochen, Manfred U / Michigan Dynamics in the Organization ISTRO-07433 $134,782 Us 07 80 (> 
of Scientific Specialities 24 
Koll, Matthew B. National Bureau Refinement and Use of Con- ISTSO-1 1607 $ 35.49% OY OL 80 (96 
of Standards cept Similarities to Informa- i2 
tion Retrieval 
Leimkuhier, Purdue Univ. Analysis and Application of IST79-1 1893 $108,099 01 16 80 133 
Ferdinand F. Information Productivity 4 
Models 
Leimkuhler, Purdue Univ. Analysis and Application of ISTRO- 12834 $ 1.819 06 06 80 134 
Ferdinand F. Information Productivity 0 
Models 
Machlup, Fritz New York Univ. Information Science: An ISTSO-08007 $ 39.973 OS 27 80 10s 
Analysis of Methodological 12 
Issues and Interdisciplinary 
Relationships 
Marcus, Richard S. Massachusetts Investigations of Computer- ISTS80-G65 16 $155,360 06 11 80 ORG 
Reintyes, J. Francis Institute of Aided Document Search 4 
Technology Strategies 
McDonough, Carol U / Lowell A Simultaneous Equation IST79-19478 $ 28,342 0S 09 80 
Model of the Demand for 12 
Academic and Professional 
Journal 
McGill, Michael Syracuse Univ. A Study of the Impact of IST79-21468 $155,613 02 OS 80 149 
Representations in Information 24 
Retrieval Systems 
Minker, Jack U / Maryland Applications of Logic and MCS79-194'8 $207,835 04,18 80 ORO 
Theorem Proving to Databases % 


Jacobs, Barry 
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DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


FY 1980 AWARDS—CONTINUED 


Award 
Principal Award Date; Summary 
Investigator(s) Institution Title No. Amount Duration No. 
{Month} 
Moss, Mitchell L. New York Univ. Publication of Proceedings of a IST80-05145 $ 9,976 04 /08 /80 106 
Conference on Telecommuni- 12 
cations and Productivity 
Mount, Kenneth Northwestern Message Spaces and Compui- IST80-07457 $ 78,409 08 /07 /80 113 
Univ. ing Time in Performance 24 
Allocation Processes 
Moyne, John Queens College Language and Human Infor- IST80-23585 $ 91,140 05 / 30/80 025 
of CUNY mation Processing 15 
Narens, Louis E. U /California, Problems in the Theory of IST79-24019 $111,264 03/11/80 O16 
Irvine Measurement that Pertain to 36 
Information Concepts 
Neff, Karen J. Hofstra Univ. Research on Automated IST80-06945 $110,633 08 /07 /80 048 
Natural Language Generation 24 
Noma, Elliot Institute for Scien- Citation-Based Measures of IST79-23661 $ 38,188 03/06/80 065 
tific Information Interactivity and Similarity 12 
O'Connor, John Lehigh Univ. Citing Statements: Recognition IST80-04353 $115,447 07/22/80 075 
by Computer and Use to 24 
Improve Full-Text Retrieval 
Parker, D. Stott U /California, Data Semantics and Logical IST80-12419 $ 57,489 07/30/80 + 021 
Los Angeles Database Design 24 
Pearl, Judea U /California, Studies in the Organization of IST79-22062 $ 39.174 03/21/80 020 
Dalkey, Norman Los Angeles Information Structures 18 
Peters, Alec Frankiin Institute Support for Coordination of IST79-24639 $ 50,000 12/04/79 043 
Research Scientific and Technical 7 
Laboratories Information Activities 
Pierrehumbert, Janet Massachusetts The Use of Intonation in IST80-12248 $ 43,701 07/03/80 O83 
Institute of Automatic Speech 24 
Technology Understanding 
Puri, Madan L. Indiana Univ. Problems in Decision-Making IST79-18468 $112,690 12/05/79 060 
and Information Retrieval 36 
Studied with Techniques of 
Fuzzy Sets and Systems 
Reintjes, J. Francis Massachusetts Investigations of Electronic IST80- 1664 $ 95,000 06/11/80 O87 
Institute of Interlibrary Resource-Sharing 12 
Technology Networks 
Rich, Elaine U/Texas, Austin Individual User Modeling in IST80-11614 $ 56,315 08 /04 /80 1S] 
Computer Systems 24 
Richards, Whitman Massachusetts Natural Computation and MCS79-23110 $234,196 04/11/80 O88 
Institute of Control 12 
Technology 
Rouse, William B. U/Illinois, Modeling of Human Behavior IST79-16471 $144,120 12/21/79 054 
Urbana/ in Seeking & Generating 24 
Champaign Information 
Sager, Naomi New York Univ. Computer-Based Investigation IST 79-20788 $274.645 01/25/80 109 
into the Structure of 24 


Information 
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FY 1980 AWARDS—CONTINUED 
DIVISION OF INFORMATION SCIENCE AND TECHNOLOGY 


Award 
Principal Award Date/ Summary 
Investigator(s} Institution Title No. Amount Duration No. 
(Month) 
Schank, Roger Yale Univ. Theoretical Foundation of IST80-17790 $183,785 09/17/80 161 
Natural Language Processing 24 
Srihari, Sargur N. SUNY, Buffalo Contextual Algorithms for IST80- 10830 $ 55,800 07/10/80 145 
Text Recognition 24 
Swamy, Sowmitri U/Illinois, Access Time Versus Storage IST80-12242 $ 59,829 06/26/80 057 
Urbana Space in Information Retrieval 24 
Systems 
Tenenhaus, Michael Wayne State Univ. Lexical Ambiguity Resolution IST80- 12439 $38,977 08 /25/80 155 
in Discourse 24 
Turoff, Murray New Jersey Electronic Information IST80-09260 $ 2,000 03/28/80 102 
Institute of Exchange Test Facility 0 
Technology 
Ullman, Jeffrey Dept. of Air Force Implementation of a Simple IST80-21358 $ 49.487 0812/80 O01 
User Model by a Relational 24 
Database 
Wainer, Howard Bureau of Social Social Science Applications of SES80-0848 | $ 40,728 05/21/80 O12 
Science Research Dynamic Multivariate Displays 12 
Warren, David SUNY, Siony Montague Grammar & Data- IST80-10834 $ 79,902 08 /06 80 147 
Brook base 24 
Weischedel, Ralph U/Delaware Natural Language Processing IST80-09673 $136,681 08/04/80 035 
Systems and Ill-Formed Inpui 24 
Whitehead, Bruce U/Louisville Representation and Processing IST80-11617 $ 77,840 09/05/80 077 
of Spatially Structured Infor- 24 
mation in Cognition 
Williams, Martha U/Illinois, Automatic Duplicate Detection IST79-21018 $269,303 02/13/80 056 
Urbana/ of Journal Articles Appearing 24 
Champaign in Multiple Data Bases—A 
Feasibility Study 
Zadeh, Lotfi A. U/California, A Theory of Approximate MCS79-06543 = $ 29,500 04/18/80 O15 


Berkeley 
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SUMMARIES OF AWARDS 
FISCAL YEARS 1978, 1979, 1980 


#00 | 


Implementation of a Simple User Mode! by a Relational 
Database. Department of the Air Force, Bolling Air Force 
Base, Washington, DC 20332. Jeffrey Ullman. $49,487 for 
24 months beginning August 12, 1980. 


searchers 2 data base and proven methods which 
ca 1 be generalized to a variety of significant infor- 
mation science questions concerned with the iden- 
tification of semantic content in terms of syntactic 
form, including applications to devices which 
produce text outputs in response to speech inputs. 

Data on spelling errors will be gathered. Word 
classification procedures to identify and separate 
vocabularies (e.g.. scientific English, taxonomic 
terms, and systematic chemica! names) will be 
investigated, as will word structure analysis tech- 
niques. The applicability of n-gram analysis to 
spelling error detection will be determined. 

A dictionary of repeated misspellings in the 
experimenial corpora will be compiled and the 
proportion of the total misspellings that can be 
corrected using this dictionary will be determined. 
Correction heuristics using co-occurrence data at 
several levels, and the usefulness of n-gram analysis in 
conjunction with word classification will be studied. 


It is proposed to build an information retrieval 
system based on relational database theory, which 
would be both efficient and easy to use. 

It is a great convenience for the user if a database 
can be viewed as a single (universal) table, and queries 
are posed about this imaginary table. However, it 
is generally too expensive of storage space to allow 
the redundance that would necessarily be present 
in such a table. Thus, relational databases usually 
consist of several tables, and queries are posed in 
terms of combinations of these tables. This under- 
lying database structure should be made invisible 
to the user, but several problems are encountered. 
One is the design problem: what tables should be 
used to store the database? A second is query 
interpretation: what access paths (connections 
between data in different tables) are implied in 
queries about the imaginary universal table? A third 
is Optimization: how to optimize the query on the 
underlying tables? A fourth problem to consider is 
how updates are to be performed. The user may 
attempt to insert or delete associations from the 
universal table that are not reflected naturally in 
the underlying tables. 


#003 

Development and Testing of a Methodology to Assess Private 
Investment for Scientific and Technical Information. Applied 
Communication Research, Inc., P. O. Box 5849, Stanford, 
CA 94305. Colin K. Mick. $103,350 for 12 months beginning 
February 13, 1978. 


The objective of this project is to develop a 
conceptual framework to define STI expenditures 


The project, in brief, will produce a system capable 
of (a) interpreting queries on the universal table as 
if they were queries on the underlying database and 
(b} assisting the database designer in selecting the 
underlying tables and in specifying the rules whereby 
query interpretation can be conducted correctly. 
Appropriate theory will be developed where needed. 


in industry and to develop and test a methodology 
for collecting these data. Because of the conceptual 
and definitional problems, as well as the difficulties of 
actually providing such data, close working relation- 
ships will be established with representatives of all 
companies to be surveyed. A task force of industry 
representatives will also be established to provide 


guidance and input to the researchers. The survey 
sample will be based upon over 500 companies 
included in the annual Business Week analysis of 
corporate investment in research and development. 


#002 

Automatic Spelling Error Detection and Correction for Large 
Data Bases. American Chemical Society, P. O. Box 3012, 
Columbus, OH 43210. Joseph J. Pollock. $153,160 for 30 
months beginning May 31, 1979. 


Chemical Abstracts Service (CAS) will undertake a 
30-month research program in automatic detection 
and correction of spe!ling errors in free text. The 
proposed research has value for broader questions 
than the correction of spelling errors in the CAS 
file; it is expected to make available to other re- 


#004 


Assessing the Information Behavior of Scientists and Engi- 
neers—A Secondary Data Analysis. Applied Communication 
Research, Inc., P.O. Box 5849, Stanford, CA 94305. Colin K. 
Mick. $69,309 for 12 months beginning June 29, 1978. 


The purpose of this study is to explore the infor- 
mation behavior of scientists and engineers utilizing 
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recently collected data. Since no new data collection 
is proposed for this project, the emphasis will be 
on identifying and testing existing information 
behavior hypotheses and on identifying and exploring 
relationships between information behaviors and 
between those behaviors and contributing factors 
such as individual attitudes, environmental con- 
Straints and situational (task) constraints. 

Products of this study will include a series of 
models for predicting information behaviors, an 
annotated listing of information behavior hypotheses, 
and an annotated bibliography of the information 
behavior research literature. 


Battelle project is to develop, test and evaluate a 
technique which will permit the user to search on- 
line computerized data bases outside his subject field. 
The proposed computer procedure will accept the 
user's own, familiar subject terms, correlate and 
convert them to the indexing terms used in other 
subject fields. This will permit the retrieval of 
documents on a given subject from all available 
data bases with a single query. 

The project's investigator plans to expand his 
earlier investigations in indexing energy information. 
Previously, from selected energy related terms used 
by a group of computerized data bases, he developed 
a preliminary, on-line subject switching technique 
for energy. To the original energy vocabulary, he 


#005 will add new terms, refine the original software, 
Advanced Concepts and Studies of a Vocabulary Switching and attempt to determine whether the success 
System. Battelle Memorial Institute, 505 King Avenue, obtained in energy can also be achieved in a much 
Columbus, OH 43201. Robert T. Niehoff. $148,900 for 12 larger group of fields. 

months beginning September 21, 1979. The result will be a better understanding of how 


There are many bibliographic data bases available 
for on-line searching. Using these data bases to locate 
information for an interdisciplinary project is very 
difficult because each data base uses the specific 
vocabulary to its subject discipline which may not 
be familiar to the researcher. A vocabulary switching 
system (VSS) has been developed which will accept 
a user's search terms and convert them to the 
indexing terms used in other fields. This permits 
the retrieval of documents on a given subject from 
several data bases with a single query. The VSS, 
which was originally developed for energy terms, 
has been extended and now can be used to correlate 
terms from several large controlled vocabularies. 
This project will investigate subject switching between 
controlled and uncontrolled vocabularies. The effects 
of variables such as the similarity or dissimiliarity 
of data bases, size and number of data bases, and 
the similarity or dissimilarity of the subject fields 
will be evaluated to determine the circumstances 
under which subject switching provides the greatest 
benefits. The role of subject switching in an inter- 
mediary-free search environment will also be 
investigated. 


to transfer information from computerized data bases 
to the end user. 


#007 


Measurement and Improvement of On-Line Chemical Searching 
Performance in Three User Groups at Small Colleges. 
Bethune-Cookman College, Daytona Beach, FL 32015. 
Richard F. Copeland. $89,535 for 18 months beginning 
June 13, 1978. 


The high fixed minimum cost of maintaining a 
traditional library of bound scientific and technical 
journals is more than many small undergraduate 
colleges are capable of supporting. Significant savings 
have been obtained by substituting on-line searching 
for abstract journal subscriptions. This suggests that 
the 1100 to 1300 such institutions may well be using 
on-line searches as their primary entry into the 
scientific and technical literature within the next 
few years. An increasing portion of this on-line 
bibliographic searching will be done by inexperi- 
enced, infrequent users—college faculty and students 
in the case of undergraduate colleges. The relative 
proficiency of these users has not been established. 
Phase I of the project will study the searching 


#006 techniques and quantify the proficiencies of three 


; distinct groups: information scientists who are 
Extension, Refinement and Evaluation of On- Automated full-time on-line searchers, college chemistry faculty 
Subject Switching. Battelle Memorial Institute, 50S King 


> who are part-time searchers, and novice searchers. 
Avenue, Columbus, OH 43201. Robert T. Niehoff. $106,186 Based on the results of Phase I. Phase II will 
for 12 months beginning May 26, 1978. 


On-line searching of multiple data bases for 
bibliographic records is extremely time consuming 
and very costly. This is due to the diversity of subject 
indexing procedures and vocabularies used. These 
create a barrier to greater utilization of existing 
information resources. The main purpose of the 
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investigate whether an on-line prompting system 
can significantly improve the absolute searching 
efficiency of any or all of the three groups of 
searchers. Recall, precision, and search time and 
cost are the variables to be measured. Randomized. 
complete block designs will be used for standard 
analyses of variance. 


#008 This research will examine the efficiency of a class 
of sequential search algorithms in which an item 
which is accessed is moved to a forward position 
on the list and will need fewer searches. The effects 


Decision-Making and Information Processing in Real-Time 
Network Models. Boston University, Department of Mathe- 
matics and Biomedical Engineering, Boston, MA 02215. 


Stephen Grossberg. $157,990 for 36 months beginning April of various parameters will be examined. These 
2, 1980. parameters include the number of elements on the 
list and number of searches. Optimal points at which 
The proposed research is an investigation of the to switch from one algorithm to another will be 
properties of rapid parallel decision-making and sdentified. 
stable self-organization of environmentally sensitive 
codes that occur in network models which have pre- #011 


viously been derived from psychological postulates. 

A key project is the study of absolutely stable 
pattern transformers, including their short term 
storage, choice behavior, and self-tuning capabilities. 
Additional work will deal with normative drifts and 


Cognitive Applications of Matrix Memory Models. Brovn 
University, Department of Psychology and Center for Neural 
Sciences, Providence, RI 02912. John A. Anderson. $44,602 
for 12 months beginning March 31, 1980. 


pattern completion, the functional units of cog- 
nitive coding, and oscillatory behavior in parallel 
processors. 

The work will draw upon the investigator's con- 
tinuing work on a general method for classifying 
the dynamics of competitive systems. This method, 
which is based on differential equations, derives 
from his previous work on the neural basis of 
cognitive mechanisms. It provides a tool for de- 
Signing, analyzing, and classifying competitive 
systems that have desirable biological behavior. The 
results have potential for application to the repre- 
sentation of knowledge in both human and electronic 
information processing systems. 


#009 


Economics of Market Structure in Scientific and Technical 
Information Activities. Brandeis University. Waltham, MA 
02154. Yale M. Braunstein. $156,523 for 24 months begin- 
ning September 6, 1979. 


This research is designed to examine both the 
market structure of the Scientific and Technical 
Information (STI) industries and related economic 
phenomena. The project combines economic theory 
and empirical testing with the objective of con- 
tributing to our understanding of such issues as 


A model for memory based on ideas from 
neuroscience has been developed and published 
previously. It is assumed that (a) nervous system 
activity is most usefully represented as a set of 
simultaneous individual neuron activities in a group 
of neurons (a state vector) and (b) changes in synaptic 
strength of a particular (correlational) kind store 
memory forming a matrix of synaptic connectivities 
which can couple associated state vectors. Recent 
extensions of this model involved positive feedback of 
a set of neurons on itself and introduction of a simple 
limiting non-linearity. This work will develop this 
model further. Such models seem to be very useful 
for describing human perceptual processes quite close 
to the sensory input. Non-linear extensions, to be 
investigated, seem to be able to make significant 
contact with questions of more cognitive relevance 
such as categorization, logical inference, and semantic 
networks. Such models may have considerable 
practical use since they are able to generalize. 
associate and categorize, yet are immune to certain 
pathologies of large systems: that is, since there are 
a fixed number of modifiable synapses operating 
in parallel, there is little possibility of the kinds of 
exponential retrieval and storage disasters that haunt 
data bases of a very large size. 


pricing, market entry, vertical integration and size #012 


of firm. Four specific areas of research will be 
addressed. These include the development of cost Social Science Applications of Dynamic Multivariate Displays. 
Bureau of Social Science Research, 1990 M Street, NW, 


functions for firms engaged in different phases of ore : _ . : 
the dissemination of STI; measurement and analysis Washington, DC 20036. Howard Wainer. $40,728 for 12 
months beginning May 21, 1980. 


of technological change over time; examination of 


the motivation of the firm in the STI industries; A newly developed method for the interactive 
and a comprehensive analysis of the structure of display of multivariate data (PRIM-9) will be used 

the STI industries. to examine diverse sets of social data chosen for 
their generality of form. The results will be compared 

#010 to results obtained using static graphical display 
Efficiency of Sequential Search Algorithms. Brooklyn College, methods. The aim is to assess the value of the new 
Department of Computer & Information Science, Brooklyn, method for detecting structure and relationships in 
NY 11210. Aaron Tennebaum. $38,376 for 12 months data of great complexity. Transportable data sets 
beginning July 27, 1979. representative of broad classes of social data will 
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be prepared for manipulatica at the one existing 
PRIM-9 facility in the U.S. These data will be drawn 
from bodies of data widely used in economic, 
demographic, and social indicators research, and 
they will include several levels of geographic aggre- 
gation, individual-level data taken from field ex- 
periments and social surveys, and simulated data 
of known (constructed) multivariate structure. 
The PRIM-9 system's utility in revealing compiex 
interrelationships, generating hypotheses about 
causal structures, and testing explicit theoretical 
propositions will be examined. The results will be 
recorded on video tape and written into a description 
and evaluation of PRIM-9 and PRIM-9-like systems 
for social science data analysis. Formal and informal 
means of reporting will be employed to inform the 
social science research community of the effectiveness 
of this new technology and to provide feedback to 
the originators and develcpers of PRIM-9. Such 
reports will include suggestions for modifications 
and improvements in the system which would 
increase the analytic power available to the research 
community. 


This research is aimed at the development of a 
method of inference from imprecise or unreliable 
daia which are expressed in the form of propositions 
in a natural or synthetic language. The main appli- 
cation areas for this method lie in the design of 
expert question-answering systems, decision analysis, 
and control of robotic systems operaiing in an 
imprecisely defined environment. 

The approach of this research is based on the 
representation of the meaning of an imprecise 
propositicn by a procedure which acts on a set of 
relations in a database and returns a possibility 
distribution for the variables which are associated 
with the proposition. The inference rules are applied 
to the translation of the premises and yield a 
possibility distribution which upon retranslation 
becomes an imprecise conclusion inferred from the 
premises. In general, the premises are allowed to 
be qualified with imprecisely defined truth values, 
probability values or possibility values. 


#015 


A Theory of Approximate Reasoning. University of California, 
Electronics Research Laboratory, Berkeley, CA 94720. Lotfi 


#0! 3 A. Zadeh. $29,500 for 12 months beginning April i8, 1980. 
Bayesian Retrieval System Design. University of California, This supplements work described in the above 
School of Library & Information Studies, 118 South Hall, project. 
Berkeley, CA 94720. William S. Cooper. $97,125 for 24 
months beginning August 31, 1979. #016 
The aim of this research is to develop mathematical Problems in the Theory of Measurement that Pertain to 
and statistical techniques for the design of more Information Concepts. University of California, School of 
effective information retrieval systems. The emphasis Social Sciences, Irvine, CA 92717. Louis E. Narens. $111,264 
is on improving the underlying indexing and search for 36 months beginning March 11, 1980. 


logic used in document and reference retrieval 
systems rather than on developing more sophisticated 
hardware. 

Retrieval will be treated as a problem in proba- 
bility estimation. A retrieval system should be able 
to determine which documents in a collection have 
the highest probability of usefulness, and to retrieve 
the documents in that order. Such probabilities can 
br best estimated on the basis of a combination of 
several types of data, and the methods of Bayesian 
statistics will be used to find practical ways of 
combining such evidence to perform the estimation 
within the computer. Estimation formulas will be 
developed for various types of search requests and 
for a variety of standard and novel statistical clue- 
types. The mathematical basis of a possible “brows- 
ing’ capability for advanced systems will also be 


The term “measurement,” as used here, refers to 
the study of scientifically interesting collections of 
objects or events whose elements exhibit some 
attribute that can be ordered quantitatively. That 
is, given any two elements of the collection, it is 
possible 10 determine by some well prescribed 
procedure which of the two has more of the attribute 
or if they exhibit it to the same degree. From a 
scientific point of view, the most difficult problem 
usually is how to device systematic ways of dis- 
covering the ordering of the attribute in question. 

The purpose of this grant is to work on some 
mathematical problems that appear to be useful in 
the measurement and the understanding of aspects 
of cognition that are relevant to principles of 
information science. These can be grouped into three 
major headings having to do with increasingly 


meEgaae. complex levels of structure: (1) Structures with one 
operation: concatenation and conjoint ones; (2) 

#Ol4 Structures with two operations: probability and 
A Theory of Approximate Reasoning. University of California, random variables; (3) Structures with many opera- 
Electronics Research Laboratory, Berkeley, CA 94720. Lotfi tions: dimensional analysis and meaningfulness. Such 
A. Zadeh. $29,000 for 12 months beginning August 9, 1979. studies of the fundamental structure of information 


At 
16 


are concerned with issues common to the empirica! 90024. Wesley W. Chu. $112,220 for 24 months beginning 
aspects of all sciences. July i 7, 1980. 

The research will include work on some of the 
important problems in the axiomatic foundations 
of measurement theory, particularly those that 
pertain to the concepts of probability, utility, risk, 
and information. 


The advent of computer network and micro- 
processor technology has established a technological 
foundation for implementing distributed computing 
and distributed databases on a computer network. 
Linking remote databases together not only pro- 
vides more economical service but also allows for 


#017 real time retrieval, update, and distribution of 
Mathematical Models of Information System Use. l niversity of information. 

California, Graduate School of Library & Information Using a dynamic programming approach, based 
Science, 120 Poweil Library Building, 405 Hilgard Avenue, on query operation graphs, which provides for 
Los Angeles, CA 90024. Robert M. Hayes and Harold Borko. progressive elimination of undesirable strategies. 
$196,640 for 24 months beginning August 14, 1979. the investigator proposes to design an algorithm 


which finds the optimal file allocation and query 
processing strategy and requires a relatively small 
amount of computation compared to the compu- 
tation required to evaluate all possible strategies. 
This algorithm will consist of two phases: the first 
phase involves determining the values for parameters 
such as query frequencies, file sizes, communication 
costs, and processing costs; the second phase involves 
making the sequence of decisions that leads to the 
optimal strategy. 


The purpose of this research is to examine four 
issues relating to evaluation of information system 
usage. Each issue is described in terms of a mathe- 
matical model which wiil be considered as hypothesis 
to be tested on data descriptive of various kinds of 
information systems. The first deals with usage as 
a function of allocation of resources, the second 
with distribution of usage, the third with significance 
of usage, and the fourth with distance and usage. 

The project will list the hypotheses, validate the 


models, and analyze the consequences of them. Two > 

#020 
phases are involved. In the first, the problems in - 
data acquisition will be identified, methods for data Studies in the Organization of Information Structures. (ni- 


versity of California, School of Engineering and Applied 
Science, Hilgard Avenue, Los Angeles, CA 90024. Judea 
Pearl and Norman C. Dalkey. $39.174 for 18 months 
beginning March 21, 1980. 


acquisition will be pilot tested, and the underlying 
mathematics will be analyzed; in the second, the 
four hypotheses will be tested and the mathematical 
consequences will be explored toward the aim of 


developing design equations. lt is proposed to study a statistical model of 
information systems, conceived in terms of proba- 

#018 bilistic linkage between possible reports (data, 
Investigating Computational Gains Using Partial Information. messages, or other information) and events to be 


predicted. Typical information systems are large 
and complex, with much marginally relevant data. 
The proposed work is directed toward simplifying 
and modifying systems in such a way that decision- 
The primary objective of this project is to develop a making becomes manageable and yet remains 


University of California, Engineering Systems Department. 
405 Hilgard Avenue, Los Angeles, CA 90024. Judea Pearl. 
$68.751 for 12 months beginning March 9, 1979. 


quantitative model describing the exchange between 
complexity and precision in non-numerical computer 
systems. This is one aspect of the general problem 
of representing information by “abstracts” which 
compensate for their incompleteness by their com- 
pactness, measured by the number of bits required 
to store the data in memory. This research addresses 
the general conditions under which a small impre- 
cision can result in substantial computational savings, 
to quantify the laxity-complexity exchanges in 
practical computer systems. 


minimally degraded by the approximating trans 
formations. Emphasis will be on procedures aimed 
at reducing the amount of information weeded for 
decisions; extending capabilities for using partial 
or incomplete information; and clarifying the role 
of judgment in decision. 


#02) 
Data Semantics and Logical Database Design. University of 
California, School of Engineering and Applied Science. 


Hilgard Avenue, Los Angeles, CA 90024. D. Stott Parker. 
$57,489 for 24 months beginning July 30, 1980. 


#019 With the increasing need for organized infor- 
File Allocation and Query Optimization for Distributed Data mation systems, a great deal of attention has been 
Bases. University of California, School of Engineering focussed recently on the problems of database design. 
and Applied Science, Hilgard Avenue, Los Angeles, CA Typically a design process will proceed through three 
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phases: (i) analysis and specification of user re- 
quirements, (ii) logical design of a database, and 
(iii) physical design for the logical database. The 
second of these, logical design, plays 2 key role in 
enabling the computer to meet user needs compre- 
hensibly and efficiently in a changing information 
environment. Logical design of relational databases 
involves the development of a schema, or collection 
of tables, in which user data may be accurately 
stored. The measure of “accuracy” is the extent to 
which the schema reflects the data semantics, and 
permits the reconstruction, through joining tables 
together, of the original data. 

This research concentrates on several topics 
currently of central importance in logical design: 
(1) data dependencies and normal forms, (2) design 
and implementation of relational views, (3) im- 
provement or enhancement of the relational data 
model, and (4) development of effective method- 
ologies for logical design. Project goals include 
unification of the data dependency field, resolution of 
several basic problems in the relational model (the 


the historical economies to scale advantages. The 
area which this proposal addresses is that of designing 
and distributing database structures over these 
networks which are good or “optimal” in some sense. 

This project will develop a methodology and 
model for designing distributed databases. The model 
is initially viewed as a two-stage process involving 
first the design of the database and then the distri- 
bution of that database. The design of the database is 
accomplished using an integer program which 
minimizes the sum of access and storage cost. The 
distribution process uses a location model for 
determining the number of copies of each file and 
then their location. Initially these models will be 
solved sequentially in an interactive manner. Since 
this will only lead te a local optimum, it is then 
planned to use the knowledge gained to develop a 
single model which will combine both steps into 
one. Solution techniques for this model will be 
developed and both models will be tested using a 
medical database example. 


use of null values in tabies, the loss of semantic #024 

information connecting tables in the schema, and Theoretical and Experimental Investigation of the Dynamics 
the ambiguity problems of multiple access paths of Information. Case Wester Reserve University, Cleve- 
between tables}, and development of dependency- land, OH 44106. William Goffman. $71,588 for 24 months 
based schema design algorithms. beginning April 12, 1978. 


The purpose of this research is to attempt to 
establish some of the fundamental principles which 
govern the beha’ or of communication processes 
and their associated information systems. It will 
have theoretical as well as practical implications. 
The theoretical portion of the research will deal 
with structural and dynamic properties of infor- 
mation transfer. 

The goals of the structural aspect is to achieve a 
subject classification which ts detined by the Interature 
content rather than by the preconceived notions of 
a group of indexers and classifiers. This would be 
especially important for emerging disciplines and 
interdisciplinary fields where no classical structure 
exists. This project will use the mathematics of graph 
theory to establish the theoretical basis for imposing 
this type of structure on a population of information 
conveying clements such as journal articles. 
#023 The dynamic properties of communication among 


Distributed Data Base Design. Carnegie-Mellon University, the members of a population such as scientists have 


been shown to behave with certain differences in 
School of Urban and Public Affairs, 500 Forbes Avenue, , 

accord: th an epidemic model such ; : 
Pittsburgh, PA 18213. William D. Haseman. $39,211 for 6 on ee . a ere 


the spread of infectious diseases. This proyect will 
months beginning December 7, 1979. SP ous s projec 


” 


ee 


Aw jaformation Processing Theory of Procedurz: Learning. 
{ wrnegie- Mellon University, Department of Psychology, 
Poiisburgh, PA 15213. Patrick W. Langley. $53,715 for 24 
manths beginning July 30, 1979. 


The purpose of this research is to develop the 
framework for artificial systems which are capable 
of learning procedures. The developmen: of such 
systems would preclude the present necessity for 
encoding large knowledge bases manually and thus 
make it economically feasible to produce information 
processing systems which would exhibit an “intel- 
ligence™ not currently possible. Such systems would 
have the capability of automatically identifying the 
inirinsic knowledge content of large stores of 
information. 


examine these differences and develop an integrated 
Recent advances in computer technology, both model in which direct contact and intermediate host 


communication can take place simultaneously. The 
various relationships will be described in terms of 
differential equations. 

The practical implications will derive from testing 
the theoretical findings experimentally on a sample of 


hardware and software, have brought about a rapid 
change in how organizations collect, process and 
store data. Many of the large central computer 
facilities are giving way to clusters of smaller com- 
puters as rapid decreases in hardware prices eliminate 


18 


articles from the Annual Review of Information 
Science and Technology. Information science qualifies 
aS an emerging interdisciplinary area. In addition, 
its selection as the demonstration case will permit 
an examination of the field in terms of its future 
directions and boundaries. 


#025 

Language and Human Information Processing. City University 
of New York, Queens College, Department of Computer 
Science, 65-30 Kissena Boulevard, Flushing, NY 11367. 
John Moyne. $91,140 for 15 months beginning May 30, 1980. 


This research is concerned with the communication 
processes which are involved in the transfer of 
knowledge from databases to users. An information 
bank or database is viewed as a body of knowledge 
which must be analyzed, segmented, classified and 
understood to be of any use. The instrument of 
communication which can specify what piece or 
pieces of information are needed from the database ts 
called a language. The research is concerned with 
comprehension of the content of the database and 
the comprehension of the language. ft is concerned 
with ih= organization and structure of ’« database 
and the ‘techniques for the retrieval of *’e data only to 
the extent that they are relevant to the process of 
comprehension. 

The investigator will examine tissues in the theory 
of the nature, organization, and processing of 
information and the complexity of models of com- 
prehension. He will survey the state-of-the-art and 
develop consistent formalizations of significant 
existing models in order to provide an apparatus 
to measure the potentials of such models and their 
claims to generality with regard to the psychological 
reality of -smprehension and communication. The 
goal is to develop a universal family of specialized 
languages suitable for information processing. 


#026 


Analysis of the Operational Parameters for the Management 
of an Academic Library. Clarkson College of Technology. 
Potsdam, NY 13676. Michael Bommer and Ronald Chorba. 
$76,152 for 12 months beginning August 30, 1978. 


This 12-month project aims to develop a set of 
operational parameters suitable for performance 
assessment and decision-making in academic libraries 
and information services. The study includes 
gathering base-line data on the kinds of, and manner 
in which, decisions are currently made in academic 
libraries. 

Conclusions will be of value to managers of 
libraries and information centers, as well as to the 
administrators of the institutions in which they 
function. 


The results, which will detail indicators essential 
to making resource allocation and acquisition 
decisions, will be particularly timely, considering 
the rising costs and declining budgets being faced 
by libraries and information centers. 

Findings will be disseminated through library and 
information science journals, as well as through the 
science and engineering information literature. 


#027 

Designing Information Systems for More Effective Use in 
Decision Making. University of Colorado, Institute of Be- 
havioral Science, Boulder, CO 80309. Kenneth R. Hammond, 
Richard L. Cook, and Robert Taylor. $105,531 for 12 
months beginning May 2, 1979. 


The hypothesis that cognitive aids in judgment 
and decision-making can improve the effectiveness 
of information systems is to be tested in this one- 
year project. The experiment will use two cognitive 
aids programs (POLICY and DIVERSE) linked to 
computerized data bases in a controlled manner 
that permits the searcher/user to cope with the 
complex task of human information and decision 
making processes. Results of this are to be compared 
with information searching and decision-making 
using {1} conventional manual methods, and (2) a 
computerized information system Sut without judg- 
ment and decision aids. Performance of the three 
groups ts to be assessed on the basis of five criteria 
for evaluating effectiveness retraceability, reliability, 
interpersonal conflict reduction, and user satisfaction. 


#028 

Information Correlates of Basic Language Structures. Columbia 
University, Center for the Social Sciences, 420 West 118th 
Street, New York, NY 10027. Zellig S. Harris. $74,989 for 
24 months beginning September 23, 1980. 


This project is part of an ongoing effort to 
determine the informational structure of language. 
The research is aimed at reducing the information 
expressed by word-structure (morphology) to the 
already established information expressed by words 
combining into a sentence (syntax). Preliminary 
investigations suggest that this can be done, and 
have developed methods for doing it. 

The whole work rests on a method of analyzing 
the structure and information of language which 
has shown that two processes—distinguished by 
objective and computable criteria—go into the 
construction of every sentence: one a direct infor- 
mational processing of combining words, and the 
other a computable process of reducing low- 
information words which does not change the 
meaning of the sentence. The present project aims 
to extend this analysis to the internal structure of 
words. 


Such an extension would show that morphology 
contains no information rot contained in syntax. 
Hence it would show that the known process of 
combining free simple words into sentences carries 
all the information carryable by language. It would 
also give the starting point for computer programs 
to replace complex words by constructions of simple 
words, in any computer processing of written 
information. 


need to be inferred, and reasoning required to 
generate appropriate responses to queries. 

The purpose of this research is to combine the 
data and document retrieval technologies in order 
to achieve a single system capable of the exact 
matching required by business applications and the 
approximate matching needed in document retrieval 
and question-answering. Such a combined system 
would narrow the artificial separations which now 


exist between the different aspects of the retrieval 
problem, and create a more flexible environment, 
responsive to user needs. 


#029 


Normal Sentence Forms and Information Structure. Columbia 
University, Center for the Social Sciences, Box 20, Low #03) 


ial Library, N ‘ork, NY 27. Charles D. 
Memorial Library New York, vo 10027. Charles D Matt ical Modelline in A ic Inf ion Retrieval. 
Parsons and Isaac Levi. $354,377 for 24 months beginning ' “one . , 
September 6. 1979 Cornell University, Department of Computer Science, Ithaca. 
P , NY 14853. Gerard Salton. $145,040 for 24 months beginning 
This proposal will undertake a multi-faceted, May 3, 1979. 


interdisciplinary investigation of the structure of 
information within a framework provided by a 
theory of language which was developed by Projessor 
Zellig Harris at the University of Pennsylvania. The 
research, which involves investigators from Columbia 
University and the University of Pennsylvania, will 
utilize the methods of distributional and combina- 
torial analysis, statistics, mathematical logic, and 
the philosophy of science. 

Tite project will seek to determine what categories 
and relations of categories express information, how 
subsidiary information is connected to main infor- 
mation, how change of information is housed in 
information structures, how overt information can 
also carry covert information, and how information 
can be checked for its truth-conditions. The various 
investigations will be unified in that they will all 
Start with the same representation of the way 
language carries information, and all will seek 
structures of information that go beyond the struc- 
ture of language. 


The purpose of this research is to improve under- 
standing of information retrieval processes, hence 
to improved retrieval systems. This proposal covers a 
number of tasks aimed at reconciling ihe results 
obtained by using different retrieval theories, and 
at explaining a number of open problems in the 
area. The aim is to derive a unified retrieval theory 
capable of representing classes of retrieval operations 
and applicable to many retrieval environmencs. 

Two retrieval theories which will be investigated 
by this project are based on vector processing 
considerations and probabilistic methodologies 
respectively. The two theories produce apparently 
different prescriptions for computing the value of 
a term for indexing purposes. These differences will 
be examined and resolved if possible. Another class 
of questions involving relevance indications related to 
the terms matching characteristics between queries 
and records for purposes of interactive retrieval will 
also be investigated. 


#32 
#030 Fast Computational Processes in Information Retrieval. Cornel! 
Research on Combining Document and Data Retrieval Tech- University, Department of Computer Science, Ithaca, NY 
nologies. Cornell University, Department of Computer Science, 14853. Gerard Salton. $29,605 for 12 months beginning 
Ithaca, NY 148580. Gerard Salton. $133,555 for 24 months July 27, 1979. 


beginning November 18, 1978. In recent years various approaches, including both 


There are currently three rather different ap- 
proaches to the problem of information retrieval: 
first, the structured data base system used in business 
processing where a small number of attributes are 
used to achieve exact match retrieval: second, the 
reference retrieval or document processing systems 
where a large number of attributes are necessary 
and only partial matches can be achieved which 
result in lowered precision and increased recall; third, 
the emerging question-answering systems where the 
file is compleiely open-ended and new facts may 


hardware and software, have been introduced for 
use in information retrieval systems. Among the 
problems that have attracted special attention are 
those which are concerned with matching query 
statements with document content. In general, the 
efforts were directed toward achieving “perfect 
match” retrieval where records whose keyword 
assignment exactly matches a Boolean combination 
of terms included in a query. 

This research will examine the efficiency and 
effectiveness of information retrieval applications 


using a hybrid system consisting of a fast compu- 
tational array processor and a conventional general 
purpose computer. The associative processor ts used 
for rapid computation and comparison of query 
statements with document contents for “closest 
match™ as opposed to exact match retrieval. Within 
this system corfiguration, the computationally 
intensive tasks are executed on the fast, parallel 
floating point processor, data storage is done with 
the usual random access devices, while the conven- 
tional text processing Operations and other control 
operations are executed on the general purpose 
computer. The project will also study the impacts 
of such systems on information processing cost and 
speed, as well as on the design of more flexible 
information retrieval systems. 


routinely acquired, stored, catalogued, or processed 
within the existing library framework, and access 
to them is currently so complex that it has become 
a field of study itself. 

One solution of growing interest 1s the establish- 
ment of a centralized national-level program to 
develop and implement standards for information 
services. In order to address the questions this raises 
and to sort out the many intellectual and technical 
tasks tt would entail, a three-day conference will 
be held in the Spring of 1978. The conference will 
bring together representatives of some 30 major social 
science, information science, and documentation 
organizations and will include about SO particspants 
all told. The conference will be guided by an advisory 
committee and will be preceded by a small planning 
confevence among key participants. Issues relating 


#033 to the development of standard cataloging proce- 


ANSI C ittee Z39. Council of National Library Asso- dures, the operational components of a central 
pei 461 West “poration Avenue Haverford. PA 19041 clearinghouse, and the products and services that 


James L. Wood and Robert W. Frase. $20,000 for 12 months would result from a coordinated effort will constitute 
beginning May 10, 1979 | —_ . the core agenda. The conference proceedings will 


Committee Z39 of the American National 
Standards Institute (ANSI) develops standards 
relevant to information systems, library activities, 
document delivery, information dissemination, 
and information and data handling in all media. 
Based on findings made by a task force sponsored 
by the Foundation and the National Commission 
on Libraries and Information Science (NCLIS), the 
scope of 239 was expanded, its program strength- 
ened, and the management of 239 reorganized. Since 
1965, when the Foundation started supporting 239, 
28 standards have been developed by 239. In addi- 
tion, 739, as ANSIs official representation to the 
International Organization for Standardization 
(ISO), has advanced the United States’ interests in 
the information activities to [SO's decision-making 
meetings. 

During the grant period, 739 will start work on 
reaffirming. revising or withdrawing 8 standards, 
and on developing “‘andards for 11 areas which 
have been selected by the information community. 


#034 


Conference on a National-L evel Program of Access to Machine- 
Readable Data Files. Data Use & Access Laboratories, Inc., 
Arlington, VA 22209. John C. Beresford. $68,300 for 12 
months beginning December 12, 1977. 


Publicly available statistics in the form of 
machine-readable data files contain much demo- 
graphic, economic, and social information needed 
for the accomplishment of social science research 
and for a better understanding of the problems 
confronted by those who work in the public interest. 
Unfortunately, machine-readable data files are not 


21 


be widely announced and disseminated to profes- 
sionals and potential users in both the research and 
the public administration communities. 


#035 


Natural Language Processing Systems and IIl-Formed Input. 
University of Delaware, Department of Computer and Infor- 


mation Sciences, Newark, DE 19711. Ralph M. Weischedel. 
$136,681 for 24 months beginning August 4. 1980. 


Although natural language interfaces to databases 
have improved markedly in recent years, they have 
only begun to consider ill-formed input which ts 
one of the major obstacles to truly natural language 
systems. Quite often natural language queries will 
be ill-formed in the sense of being misspelled. 
ungrammatical, or not entirely meaningful. A re- 
quirement for any successful natural language 
interface must be that the system either guesses at 
a user's intent, requests direct clarification, or at 
the very least, identifies the nature of the ill- 
formedness in an intelligent way. In this project 
the researchers will investigate the hypothesis that 
ill-formedness is rule-based. Violation of the rules 
of normal processing will be used to signal ill- 
formedness. Meta-rules that modify the rules of 
normal processing will then be used for error 
identification and recovery. 

It is anticipated that this approach will ease system 
development and processing. First, because it will 
be possible to design the normative processing system 
independent of the error identification and recovery 
methods. Second, not invoking ill-formedness 
processing until normal processing fails, avoids 
unnecessary run-time costs for well-formed sentences 
which are the usual type of input. And third, 


Gescribing ill-formedness through meta-rules that 
relate to normative rules will avoid duplication of 
aspects of normative processing and allcw general 
Statements covering classes of ill-formedness. 

This project will be performed in collaboration 
with Norman K. Sondheimci, whose partic’pation 
will be supported by Sperry Univac, Blue Bell, 
Pennsylvania. 


computer networks, discipline-based networks, 
and special services such as systems for bibliographic 
search and retrieval. Site visits will be made to the 
organizations, and interviews will be conducted with 
managers and with users. Survey data will be col- 
lected and analyzed. 

The objectives of this research are: {1} to develop a 
clearer understanding of resource-sharing and the 


#036 the users, and the communication processes; (2) to 


The Design snd Assessment of Individuziized Instruction in determine what effect computer resource-sharing 
Bidli hic Data Base A Drexel University, Grad- has on the efficiency and effectiveness of research 
uate School of Library Science, Philadelphia, PA 19104. usir-g it; and (3) to provide guidance for organizations 
Charles T. Meadow. $229 “00 for 24 months beginning concerned with resource-sharing at the policy-making 
February 13. 1978. and operational levels. 


The Drexel University Graduate School of Library #038 
Science and the Franklin Institute Research Lab- 
oratories, are continuinz, in this joint effort, initiated 
under S1S76-09737, to design and evaluate a system 
providing individualized :nstruction in accessing and 


Electronic Information Exchange in Microcomputer Component 
Standardization Research. Electronic Industries Association. 
2001 Eye Strcet, NW, Washington, DC 20006. John F. 


way it 1s influenced by characteristics of the resources, 
| 
| 
| 


retrieving computer-stored information files. The 
evaluated computer-based aids expected as products 
of their research should provide a wide range of 
information users with alternatives to querying 
systems through information intermediaries and or 
to learning to use systems in training sessions not 
designed for their needs. 

Products of the |12-month initial phase include a 
working mode. of the instruction system, and initial 
evaluation of its feasibility, cost effectiveness, and 
transportability to a variety of computer environ- 
ments. This phase will concentrate on further design 
and assessment of the sub-component of the system 
devoted to diagnostic and assistance programs which 
monitor and provide feedback to users attempting 
to improve their search abilities. In addition to a 
more aroplified working model of the systems, second 
phase results are expected to provide input into the 
design of future storage and retrieval systcms. The 
long-range goal to which this project will contribute ts 
the availability of systems with built-in capacity to 
assist users in forming their information requests, 
and in locating and retrieving information relevant 
to their needs. 


The Joint Electronic Device Engineering Courcil 
(JEDE), a standing committee of the Electronic 
industries Association, will explore the use of 
electronic information exchange for selected aspects 
of its work on hardware and software standardization 
of microcomputer large scale integration (LSI) 
products. Via the NSF-sponsored electronic infor- 
mation exchange (EIES), JEDEC will structure and 
moderate standardization discussions among in- 
dustry representatives drawn from regular stand- 
ardization committees and task groups of JEDEC. 
Observers of this process, also via EIES, will include 
representatives of such other interested groups as 
computer scientists, computer hobbyists, professional 
societies, trade associations, the U.S. government 
{mintary). and the trade press. Electronic information 
exchange will be compared with conventional means 
of conducting JEDEC standardization activities with 
respect to: amount and effectiveness of communi- 
cation, elapsed time in processing standardization 
decisions, ability to resolve standardization issues, 
and cost-effectiveness in financial and other respects 


Hessman. $77,113 for 18 months beginning July 24, 1978. 


#39 


#037 Development of Theory and Techniques for Automated know!- 
Factors Influencing Network Sharing of Computer Resources in edge Utilization, Transfer and Extension. ( niversity of Florida. 
Research. EDLCOM., P. O. Box 364, Princeton, NJ 08540. Gainesvilie, FL 32611. Julius T. Tou. $166,740 for 24 months 
James C. Emery. $362,269 for 20 months beginning veginning August 3, 1979. 


September 28, 1978. This research project is aimed at exploring the 


Several organized experiments in resource sharing 
by computer networks are underway. This project 
will study the operation of ten resource sharing 
organizations to analyze the impact of their services 
on the use of network information services by 
research scientists. The study will include regional 
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design concepts and techniques for automated 
knowledge utilization, transfer and extension. During 
the initial stages, efforts will be concentrated on 
the development of theories and methods for tie 
representation and organization of knowledge and 
for knowledge-based design and growth. This in- 


vestigation will be followed by the study of the 
requirements and proce “ures for on-line knowledge 
transfer, interactive browsing, and selection tech- 
niques for knowledge transfer. The total project is 
slated fur two years and ts expected to contribute 
to our understanding of the fundamentals of know}- 
edge processing. 


F040 


‘ egradation of Accuracy in Social Network Data. l niversity of 
Florida, Gainesville, FL 32611. H. Russell Bernard. $19,402 
for 12 months beginning December 18. 1979. 


Scientists desire to improve the methodology of 
information science research, particularly the 
methodology for identifying and studying infor- 
mation networks. They also desire to develop 
measures of rehability and validity for network daia. 
This project will assess the nature and degree of 
human error in data gathered by sociometric tech- 
miques from information networks. It will also assess 
the quality of the data that are collected by providing 
a comparison of sociometric survey data witn 
electronically recorded logs of social interactions. 
The research wil! be carried out with the cooperation 
of volunteers from research groups now using the 
electronic information exchange system at the New 
Jersey Insitute of Technology. Started at the West 
Virginia University, this research will be completed at 
the University of Florida where the Principal In- 
vestigator is now located. 


#4) 


Multiple Attribute Retrieval Techniques for Tree-Structured 
Information. Florida. State University, School of Arts and 
Sciences, Tallahassee, FL 32306. Caroline M. Eastman. 
$24.063 for 12 months beginning August 7, 1980. 


Search algorithms based upon simultaneous use 
of several attributes of the data will be investigated. 
Such “multikey” searches are used in many types 
of information retrieval systems, including both 
document retrieval systems and database man- 
agement systems. 

The various multikey search algorithms, func- 
tioning and proposed, can be classified by the type(s) 
of query handled, by the evaluation metric for 
responses and on the basis of the underlying file 
structure or database. The focus of this research is 
on tree-structured algorithms. Internal nodes of a 
tree contain information to guide the search by 
indicating how well the records in their respective 
subtrees match the query. 

Analytic approaches have typically made restric- 
tive assumptions while experimentation has been 
unwieldy. The proposed work seeks a different 
course, posing a complex model of a class of search 
algorithms and analyzing its performance with 
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artificially generated distributions (involving so-called 
K -d trees} as well as some complex real data. 


#042 

A Study of the Potential Influence of Social, Economic, 
Regulatory and Technological Factors on Scientific and 
“echnical Communication Through 2000 A.D. Forecasting 
International, Lid., 1001 North Highland Street, Arlington, 
VA 22201. Norman Nisenoff. $178,775 for 24 months 
beginning September 13, 1978. 


The purpose of the project is tc improve ouz 
understanding of the interaction of technological 
and non-technological factors influencing future 
changes in the scientific and technicai information 
{ST1) transfer process. This research will examine 
the influence of social, economic, regulatory and 
technological events upon established trends in the 
processing and distribution of STI, and predict th 
consequences of such trends and pos:ible future 
events. The trends studied will related to both 
capabilities and needs within predicted environments 
for the year 1980, 1990 and 2000, corresponding tc 
the occurrence or non-occurrence of selected subsets 
of events. 

Descriptions of alternative futures will be devel- 
oped by the application of trend analysis and cross- 
impact analysis using existing data bases as a nucleus. 
This is to be augmented by individual as well as 
collective judgment during a workshop session 
involving expert consultants. 


#043 


Support for Coordination of Scientific and Technical Infor- 
mation Activities. Franklin Institute Research Laboratories, 
Philadelphia, PA 19103. Alec Peters. $50,000 for 7 months 
beginning December 4, 1979. 


The purpose of this project is to provide technical, 
consultative, analytic and study support services 
to the Foundation’s Division of Information Science 
and Technology (IST) in its conduct of coordination 
activities at the national and international levels. 

The contractor shall perform the following tasks 
as shall be further specified by IST: {1} Logistical 
and administrative arrangements for conferences 
on topics to be designated by IST; and (2) Prepara- 
tion of analytical papers on designated topics which 
focus on special issues relevant for coordination of 
national and international information activities. 

This continues work started under NSF Award 
DS177-14816. 


#044 

The Effects of ST] Changes on the Engineering Profession. 
The Futures Group, \24 Hebron Avenue, Glastonbury, CT 
06033. Hedvah L. Shuchman. $129,927 for 15 months 
beginning June !3, 1978. 


This study is designed to examine the effects of 
changes in scientific and technical information 
transfer on the engineering profession. The objectives 
of the study: are to identify the information used 
by technologists of various disciplines and special- 
ties; identify the major modes of communication and 
patterns of use; describe changes which may be 
anticipated in information use patterns and com- 
munication modes as a result of changes in infor- 
mation storage and dissemination methods; and 
identify policy implications resulting from these 
evolutionary changes. 

By means of a large scale survey a broad spectrum 
of engineers will be asked to provide :nformation 
on their use of S&T information. Data collectec 
will be analyzed to provide information profile and 
use patterns of engineers, and to project changes 
of the profile and pattern resulting from advances 
in information processing technologies. This research, 
which will cover technologists in industry, govern- 
ment and academic research, will be of value to 
informatio: service managers and policy makers. 


#045 


Syntactic Structure of Information and Information Processes. 
Georgia Institute of Technology, Atlanta, GA 30331. Viadimir 
Slamecka. $137,790 for 24 months beginning August |, 1979. 


This research is intended to increase fundamental 
understanding of the syntactic structure of infor- 
mation and information processes. The purpose is 
to gain a better understanding of human processing 
of sign shapes (words) as measured by information 
transfer rates and to determine the redundancy curve 
for natural language. This research will rely on the 
ediometer, an instrument developed at the Georgia 
Institute of Technology for measuring the deviance of 
artificial words from the norm. 


#046 


Study of Potential New Applications of Information Theory 
to Problems of Information Science. Georgia Institute of 
Technology, School of Information and Computer Science, 
Atlanta, GA 30332. Pranas Zunde. $82,449 for 12 months 
beginning September 10, 1979. 


The purpose of this study ts to: (1) critically 
evaluate what impact information theory has had 
on the broad domain of information science: (2) 
identify problem areas of iniormation science to 
which information theory is potentially applicable; 
and (3) explore promising extensions of theory 
towards broader-based foundations of informa- 
tion science. This is to be achieved by an in-depth 
study of the first two items by the investigating team, 
followed by a workshop of a small group of recog- 
nized experts who will concentrate their deliberations 
principally on the third item. 
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#047 

Verification of Syllable Hypotheses in Automatic Phonetic 
Analysis of Spoken English. Haskins Laboratories, Inc. New 
Haven, CT 06510. Franklin S. Cooper. $109,475 for 12 
months beginning August 30, 1979. 


Automatic phonetic analysis of fluent speech is 
an essential component of computer speech- 
processing systems. Dealing successfully with fluent 
speech requires that the system make use of knowl- 
edge about the languazc (top-down processing} as 
weil as parametric representation of the speech 
waveforms (bottom-up processing}. At an inter- 
mediate level, the parametric representation is seg- 
mented and categorized into a sequence of symbols 
and words, resulting in a drastic reduction in the 
informztion rate. This research ts focussed on the 
problem 0} parameter-to-symbol transformations. 


#048 


Research on Automated Natural Language Generation. Hofstra 
University, Comparative Literature and Languages, 100 
Fulton Avenue, Hempstead, NY 11550. Karen J. Neff. 
$110,633 for 24 months beginning August 7, 1980. 


Recent years have witnessed considerable research 
devoted to natural language processing by computer. 
This progress has been substantiated. But this work 
has tended to con: «<*='rate on machine analysis, 
understanding, and mference patterns and has largely 
ignored the question o/ how to produce a variety 
of intelligent ouiputs. There are no known computer 
programs that will accept an input topic and freely 
generate a basic informative paragraph on that topic. 

This project will: (a) describe in formal terms 
the structured properties of basic non-narrative 
paragraphs, (6b) investigate the connections between 
the syntactic form of such paragraphs and their 
semantic content, or the type of information they 
convey; (c) examine the proper use of stylistic 
variation in sentences and paragraphs by introducing 
such variation into a paragraph-generating computer 
program; end (d) show how paragraph patterns of 
informatic# :n text can be algorithmically identified. 


#049 
Video Full Text Processing. //7 Research Institute, \0 West 


for 24 months beginning August 23, 1978. 


Problems of using video technology for full-text 
delivery are being explored through simulation of 
a video text system in which the output will be 
displayed on a standard television screen. This 
research project will consider: (1) methods for 
overcoming the low resolving power of the standard 
television screen through electronic reformatting 
techniques; (2) human factors anai.sis of a video 


text system; (3) trade-offs of efficiency among 
electronic text distribution, hybrid systems, and 
major alternatives. These factors will be studied using 
available standard television and computer equip- 


potential roles of the librarians and information 
specialists, the requirements for the library profession 
and library services, and other problems affecting 
the various areas of the library. 


ment to manipulate video images of text and to 
display them to selected user groups. Reading 
efficiency performance parameters for a high con- 
trolled user's group will be studied, using the sh=red 
task paradigm to identify optimal system design 
parameters. Later, using the optimal parameters, 
use Statistics will be studied for several subject 
populations to whom the system will be made 


#052 


Theoretical Issues in Natural Language Processing-2: An 
Interdisciplinary Conference. University of Illinois, Coordinated 
Science Laboratory, Urbana, IL 61801. David L. Waltz. 
$14,242 for 6 months beginning June 26, 1978. 


This award is to support an aterdisciplinary 


available. conference to be held July 25-27 at the University 
of Illinois at Urbana-Champaign. It will bring 
#050 together researchers interested in using computer- 


languages to represent and process the texts of 
documents prepared in natural languages. Results 
will be applicable to the design and development 
of advanced information storage and retrieval 


An Augmented English Dictionary for Information Retrieval. 
11T Research Institute, 10 West 35th Street, Chicago, IL 
60616. David S. Becker. $83,285 for 12 months beginning 
June 26, 1980. 


This proposal addresses problems of information 
retrieval systems, including ambiguous words, 
incomplete or inconsistent indexing, and terminology 
not recognized or understood by the searcher and 
others. 

The project aims to study and develop algorithms 
and computer programs which automatically dis- 
ambiguate and index terms, thereby providing an 
augmented English dictionary for use in information 
retrieval. Specific objectives include: study and 
develop methods for disambiguating and indexing 
terms, based on keyword methods, conditional 
probability, and methods from artificial intelligence, 
test and measure the reliability of such approaches: 
characterize the structure of the dictionary and the 
nature and import of its operations and failure modes 
if any. 


systems which (1) allow utilization without special- 
ized knowledge of computer languages; and (2) 
provide more assistance to users in accessing and 
retrieving required information. 

The conference is expected to attract about three 
hundred researchers in linguistics, computer science, 
information science, and cognitive and social psy- 
chology. The conference will have six sessions each 
consisting of 5-6 invited papers from outstanding 
researchers along with one or two papers selected 
from those submitted to the conference. 

The collection of papers to be produced will be 
widely circulated and is expected to familiarize both 
interested researchers and others with current ideas 
and paradigms in storing and processing natural 
language text. 


#053 
Storage and Access Costs in Data Structures for Information 


#05 | 

Impact of a Paperless Society on the Research Library of the 
Future. University of Illinois, Library Research Center, 
Urbana, IL 61801. F. W. Lancaster. $50,540 for 12 months 
beginning June 11, 1978. 


Retrieval Problems. University of Illinois, Coordinated 
Science Laboratory, 1101 West Springfield, Urbana, IL 
61801. Donna J. Brown. $62,663 for 24 months beginning 
August 14, 1980. 


The purpose of the project is to investigate the 
implications of paperless communication systems 
for the research library. The study essentially involves 
two facets: (1) a projection of the impact of present 
and future technology on the production and dis- 
tribution of primary and secondary literature: and 
(2) an understanding of the changing roles and 
functions of research libraries in this evolutionary 
environment. Several alternative scenarios of library 
activities up to the year 2000 will be depicted 
assuming differing levels of automation. Policy and 
operation issues to be explored and investigated 
include possible interrelationships among future 
libraries and between libraries and publishers, the 
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It is proposed to study efficient storage and 
accessing of data in information retrieval systems. 
Two objectives of information retrieval systems 
are {i) to store data as compactly as possible and 
(ii) to be able to quickly retrieve information and 
perform desired updates. There are, however, 
tradeoffs between the efficient storage and access- 
ing of data, and these tradeoffs are dependent on 
the precise application. One phase of the proposed 
work is to study particular data structures and the 
Storage-access cost tradeoffs inherent in performing 
certain operations. A second phase of the work will 
be development of analysis techniques which can 
be applied to various retrieval problems, using cost 
measures appropriate to the particular applications. 


Preliminary work has been directed toward ob- 
taining lower bounds, indicating inherent limitations 
in storage and retrieval of information. What is 
particularly encouraging is that these lower bounds 
are essentially tight for the implementation schemes 
considered. Thus, deternzining lower bounds has 
led to “optimal” retrieval methods. It is proposed 
to continue this work, considering more complicated 
data representation schemes and more complicated 
cost measures. 


and terminology of other systems. These conventors 
or translators which make systems appear to be 
alike to the searcher can be called transparency aids. 
An integrated collection of transparency aids then 
would comprise a transparent system. This project 
will: {1} design a generalized model for a transparent 
system, (2) analyze the alternative factors affecting 
the transparency interface, (3) develop a skeletal 
user-oriented terminal for accessing the transparency 
system, and (4) integrate the transparency subsys- 
tems. The alternatives study will show advantages 


and disadvantages of various model confignrations, 
and the user-oriented terminal developmen. together 
with the subsystem integration will permit evaluation 
of the various subsystems by showing how well they 
interact to perform their functions. 


#054 


Modeling of Human Behavior in Seeking and Generating 
Information. University of Illinois, Coordinated Science 
Laboratory, College of Engineering, Urbana, IL 61801. 
William B. Rouse. $144,120 for 24 months beginning 
December 21, 1979. 


The goal of this research is to develop analytical 
techniques that are applicable to the design of 
computer systems that understand their users and 
can adapt the human-computer interface so as to 
best meet the needs of the particular human using 
the system at the moment. 

Development of mathematical models of human 
behavior in information generation and retrieval 
will be undertaken. The specific tasks to be studied 
include text editing and searching/browsing through 
data bases. The models will be based on Markov 
process theory, and will be formed by analyzing 
the human's keyboard entries. The resulting models 
will be useful for various applications, especially in 
the area of training. 


#056 

Automatic Duplicate Detection of Journal Articles Appearing in 
Multiple Databases—A Feasibility Study. University of Illinois, 
Information Retrieval Research Laboratory, 291 Coordinated 
Science Laboratory, Urbana, IL 61801. Martha E. Williams. 
$269,303 for 24 months beginning February 13, 1980. 


The purpose of the study is to develop a meth- 
odology for automatically detecting overlap among 
database records found in multiple databases. There 
are currently some 71 million bibliographic records in 
computer-readable form. The problem of overlap 
among them represents duplicative efforts in database 
generation; it represents duplicative efforts in storing 
and processing records for researchers who search 
multiple databases for a particular query. Thus, 
duplication represents a real cost to producers, 


processors, and users of databases. 

The project will develop a methodology for 
automatically determining overlap among databases. 
It will test and implement this methodology on seven 
databases: AGRICOLA, BIOSIS, Previews, CACon, 


#055 

An Integrated Man/Machine Interface to Facilitate Network 
Resource Utilization. University of Illinois, Urbana, IL 61801. 
Martha E. Williams. $174,432 for 18 months beginning 
February 14, 1978. 


The purpose of this research is to investigate the 
complexity of factors which are barriers to the use 
of machine-readable files of the scientific and 
technical literature. The majority of the world’s 
currently published abstracting and indexing lit- 
erature is in computer-readable form. Unfortunately 
there are no standards governing the format of data 
bases, the elements and vocabularies included in 
them, or the on-line systems for searching them. 
The lack of standards imposes requirements on 
searchers to become familiar with the variety of 
exis(ing data bases, vocabularies, access protocols, 
system features and command languages as well as 
to keep up-to-date with changes in all of them. 

Many of these differences could be made less 
apparent by developing translators for converting 
the procedures or conventions or 'erminology of 
one system into equivalent procedures, conventions, 
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Excerpta Medica, Medline, Science Citation Index 
{SCI}, and Social Science Citation Index (SSC]I). 
The proposed methodology involves analysis of the 
databases to determine common data elements in 
order to construct effective keys for matching records 
in multiple files. The study will produce overlap 
Statistics among the seven databases, data element 
frequency statistics, and a methodology for duplicate 
detection that might be used by database producers 
or database vendors. It could serve as a preprocessing 
Step to reduce storage and manipulation costs or 
as a post retrieval step to eliminate the duplicative 
output delivered to users. 


#057 


Access Time Versus Storage Space in Information Retrieval 
Systems. University of Illinois, Urbana, IL 61801. Swomitri 
Swamy. $59,829 for 24 months beginning June 26, 1980. 


It is proposed to study the performance of 
algorithms that retrieve information from a direct 
access file system. Specifically, the aim of the work 
is to derive results on the tradeoff between access 
time and storage space. 

Because there are many types of query (e.g., exact 
match, partial match, range query) and many types of 
algorithms (based on, e.g., probability theory, 
queuing theory, combinatorics), it is unrealistic to 
expect to find a theory universally applicable to 
these disparate search problems and algorithms. 
Therefore to render the problem manageable the 
class of retrieval systems will be narrowed {i} by 
restricting the kinds of information that is stored 
in the records and {ii} by considering searching 
problems where the query concern some mathe- 
matical domain. The first restriction will constrain 
the kinds of answers that a system may provide, 
while the second will make possible use of the 
mathematical relationships in achieving the goal 
of relating parameters of storage space and access 
time. 

The results will take the form of nontrivial lower 
and upper bounds for access time as a function of 
Storage space. The upper bounds will involve the 
construction of specific retrieval schemes that achieve 
those bounds. 


#058 


Siructuring the Lexicon and the Thesaurus with Lexical- 
Semantic Relations. ///inois Institute of Technology, Computer 
Science Department, 3300 South Federal Street, Chicago, 
IL 60616. Martha W. Evens. $43,575 for 24 months begin- 
ning August 6, 1979. 


The development of hierarchical structures, such as 
indexes and abstracts, to enable ease of access to 
large stores of information is a major concern of 
information science. The advent of computerized 
information processing systems has made it possible 
to generate some of these tools, such as indexes, 
automatically. But the computer also offers the 
potential for much more powerful information 
retrieval such as question-answering systems and 
other models of natural language processing. If this 
potential is to be realized, more powerful processing 
tools which express more and deeper relationships 
among words than synonymy and autonomy must 
be made explicit for the machine. The question is: 
Can these lexical-semantic relations be identified 
automatically? 

The first step in this research will be to identify 
new lexical-semantic relationships and to derive their 
logical properties. The second step will be to extract 
word-relation-word triples from dictionaries. The 
target will be the expanded system of relations 
developed in Step |. The third step is to design a 
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thesaurus using these relations and conduct a pilot 
study of its effectiveness in enhancing recall. 


#059 

The Economics of Libsary Subscriptions: A Sample Survey. 
Indiana University, Bloomington, IN 47401. Herbert S. White. 
$61,025 for 12 months beginning June 13, 1978. 


The objective of this project is to examine the 
impact of periodical availability in libraries on the 
nlacement and cancellation of subscriptions. De- 
creasing subscriptions by individuals, the impact 
of inflation on library budgets, and the growth of 
networks and consortia are believed by many to 
have seriously eroded publisher income. The re- 
seachers will develop information and data on the 
role played by each of the above factors on sub- 
scription levels. The results will lead to a better 
understanding of the magnitude of today’s problems, 
will assist managers of information services in their 
planning, and will provide policy makers with hard 
data and information on which to base their 
recommendations. 


#060 

Problems in Decision-Making and Information Retrieval 
Studied with Techniques of Fuzzy Sets and Systems. /ndiana 
University, Department of Mathematics, P. O. Box F, 
Bloomington, IN 47401. Madan L. Puri. $112,690 for 36 
months beginning December 5, 1979. 


The objective of this project is to develop and 
use techniques and results of fuzzy sets and systems 
theory to provide realistic solutions of problems in 
(1) decision-making under uncertainty; (2) presence 
of imprecision in information retrieval systeras; and 
(3) subjective evaluation of random phenomena in 
information retrieval systems. Models of information 
retrieval systems incorporating the concept of fuzzy 
sets and systems will be developed which can be 
used to facilitate accurate decisions even if the input 
information (data) is only approximately known. 
In many areas of information science, such as pattern 
recognition, decision-making, information retrieval, 
and management information systems, among 
others, human judgment is often imprecise and the 
decision-maker manipulates not with numbers but 
with vague concepts. These vague concepts are 
technically known as “fuzzy sets.’ By studying 
information retrieval systems with a continuum of 
information, new models for large information 
systems can be built which will take into account 
approximate grouping of the information items and 
the possibility of measuring the subjective infor- 
mation contained in these groups. Eight specific 
decision-making and information retrieval prob- 
lems and their solutions in terms of the concept of 
fuzzy sets and systems are outlined as specific 
research tasks 


#061 in studies of the social system of science and science 


Electronic Information Exchange in Research on Devices for policy; and (3) identifying areas of information 
the Disabled. Innovative Systems Research, Inc., 103 Cooper science research that would enhance applications 
River Plaza East, Pennsauken, NJ 08109. Jane McCarroll. in research on the scientific enterprise and science 
$20,275 for 12 months beginning June 6, 1979. policy by improving such indicators. 


The objective of this project is to conduct a #064 
planning meeting and pilot test of the use of elec- 
tronic information exchange in the development of 
an improved nationwide delivery system for reha- 
bilitation technology products, services, and 
information. 

The project has been funded, as an addition to 


User Requirements for Scientific Reviews. /nstitute for Scientific 
Information, 325 Chestnut Street, Philadelphia, PA 19106. 
Susan E. Cozzens. $31,421 for 12 months beginning Novem- 
ber 7, 1978. 


Increasing the use of scientific reviews has been 


DSI77-17924, by the Division of Information Science 
and Technology with funds provided by the Foun- 
dation’s Directorate for Applied Science and 
Research Applications (ASRA), as part of the 
program for “Science and Technology to Aid the 
Physically Handicapped.” 


proposed as a solution to the problems of the 
“Information Explosion,” because reviews evaluate 
and synthesize research results and present them in 
a condensed form. One of the important functions 
served by review articles is for current awareness 
outside of one’s own speciality. Therefore, the 
efficient use of reviews serves to cross-fertilize ideas 


#062 and innovations in science. 


The Structure and Devel ¢ of the Information Science This study will examine the relationship between 
Literature: A Citation Study. Institute for Scientific Infor- Saas & S variety of science-related roles and the 
mation, Philadelphia, PA 19101. Henry G. Small and Gabriel characteristics of the reviews they prefer to use. It 
Pinski. $32,689 for 12 months beginning August 23, 1978. is posited that level of research activity will be related 


A bibliometric and citation analysis of the field 
of information science will be carried out which 
will utilize citation and publication data from the 
Social Sciences Citation Index (SSCI). A nine-year 
(1969-1977) file of data will be extracted from the 
SSCI and mounted on an on-line system for manip- 
ulation and graphic display of citation networks. 
This on-line system will permit analyses of the 
internal structure and development of the field of 
information science. 

In the second phase of the 12-month study, a 
second data base, a clustering version of a three 
year (1972-1974) SSCI file, will be used to investigate 
connections between information science and other 
areas, and to conduct a comparative study of 
information science and the field of education 
research. 


to the frequency of review-use for current awareness, 
and that the level of codification of the researcher's 
field will affect the extent to which reviews are used 
for bibliographic retrieval. The kind of review desired 
for current awareness will vary, in turn, by the 
frequency of use. Frequent users are expected to 
need more detailed, authoritative, and current 
reviews. 

A questionnaire mailed to 1200 scientists and 
engineers will provide the primary survey instrument. 


#065 


Citation-Based Measures of Interactivity and Similarity. 
Institute for Scientific Information, 325 Chestnut Street, 
Philadelphia, PA 19106. Elliot Noma. $38,188 for 12 months 
beginning March 6, 1980. 


Bibliographic citation patterns contain important 
information on the interactions among publishing 


#063 entities. In order to extract this information from 
the various kinds of publishing data, it is necessary 
to formulate appropriate measures and associated 
statistical methods. In previous and ongoing work 
the principal investigator has developed a measure 
of the flow of influence among publishing entities. 
An important property of this measure is that it is 


Research Needs and Applications for Indicators Based on the 
Scientific and Technical Literature. /nstitute for Scientific 
Information, 325 Chestnut Street, Philadelphia, PA 19106. 
H. Roberts Coward. $18,142 for 6 months beginning 
September I1, 1978. 


This award is for a two-conference project which 
will bring together information scientists, analytical 
methodologists, and science studies researchers for 
the purpose of: (1) assessing the existing body of 
research concerning quantitative aspects of the 
scientific and technical literature; (2) identifying in 
this existing research indicators with applications 
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not distorted by variations in size or by variations 
in the amount of self-citation. 

The proposed work involves the development of 
measures for other important aspects of bibliographic 
systems, specifically the intensity of interaction and 
the similarity of two entities with respect to how 
each relates to various third parties. The similarity 


measure will provide the basis for creation of 
multidimensional maps using the technology of 
multidimensional scaling. These maps will in turn 
be compared, using a cluster similarity measure, 
newly developed by the principal investigator. This 
latter measure provides a tool for further exploration 
and validation of the interaction and similarity 
measures mentioned above. 

The methodology to be developed will be appli- 
cable in a variety of ways. For example, the cluster 
similarity measure would make it possible to trace 
the historical evolution of the bibliographic rela- 
tionships in a discipline. More generally, the statistics 
of cluster resemblance would enhance the usefulness 
of multidimensional scaling for its widespread use 
in bibliometric and psychometric studies. Finally, 
interaction and similarity are relevant not only to 
bibliometric studies but also to indexing and 
classification. 


#066 


Human Requirements in Information Retrieval. Johns Hopkins 
University, Baltimore, MD 21218. Alphonse Chapanis and 
Gerald D. Weeks. $195,841 for 24 months beginning 
February 5, 1979. 


The purpose of this research is to examine the 
human requirements in the use of information 
systems. The principal objectives are to: (a) develop, 
refine and apply research and evaluation method- 
ologies appropriate to making these requirements 
explicit: (b) collect quantitative data on these user- 
system behaviors; and (c) present the finds in a 
form that will be useful to designers in the devel- 
opment of more effective information retrieval 
systems. 


The project involves three major tasks. The first 
will obtain: (a) a realistic estimate of recall and 
precision ratios of documents in different areas of 
science; (b) an estimate of the inter-rater reliability 
of expert judgments of document relevance in 
selected areas of science, and (c) a better apprecia- 
tion of how users assess the “relevancy” of docu- 
ments and the cognitive processes they use in those 
assessments. 

The second task will obtain quantitative data on 
a number of aspects of man-system interactions 
and construct a model of an information retrieval 
system with quantitative statements of its human 
requirements. 

The third task is a controlled experiment com- 
paring the effectiveness of query formulation 
developed between users and information specialists 
who communicate in three modes: interactive tele- 
type, telephone. or face-to-face. 
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#067 

Improving the Use of Computer-Based Information Resources 
Within the Engineering Classroom. University of Kansas, Center 
for Research, Inc., 2291 Irving Hill Drive, Lawrence, KS 
66045. Dale Rummier. $61,421 for 15 months beginning 
January |, 1978. 


This award continues research initiated in 1976 
under DS1I76-08949 to test and evaluate the costs 
and usefulness of using computer-based information 
resources as a teaching tool in a range of engineering 
courses. Materials for teaching access to and use 
of information services have been developed and 
introduced into engineering design courses at the 
University of Kansas. Utilization of these materials 
by students, the cost of using these information 
systems to teach the courses and the effects on 
students and faculty have been monitored and 
assessed. In addition to the report summarizing 
results of the first year, a workshop was held at the 
1977 ASEE Summer School to inform chemical 


engineering faculty about the progress of the study. 

The second year effort will revise materials and 
Strategies developed in light of feedback; expand 
the universe of students; and evaluate longer-term 
effects on participants in 1976-77 courses. Trans- 
portable teaching materials, guidelines and strategies 


for their use, and assessments of the costs and effects 
of using these materials will be the products of the 
second year effort. A workshop is planned for 
summer 1978, as one means of disseminating the 
results of the project. 


#068 


Development of Statistical Indicators of Scientific and Technical 
Communication. King Research, Inc., 6000 Executive 
Boulevard, Rockville, MD 20852. Donald W. King. $9,281, 
November 15, 1977. 


Costs adjustment to a predecessor award described 
below. 


#069 


Data Gathering for Statistical Indicators of Scientific and 
Technical Communication from Primary Sources. King Re- 
search, Inc., 6000 Executive Boulevard, Rockville, MD 2085). 
Donald W. King. $147,442 for 12 months beginning 
November 22, 1977. 


This is the third year of a continuing effort to 
develop statistical indicators of scientific and tech- 
nical communication. This third-year effort will result 
in the development of the first detailed model that 
will allow us to trace the flow and predict the impact 
of changes that are likely to occur in scientific and 
technical communication as a result of changes in, 
for example, copyright legislation, library automation 
and page charge policy. This model will also spe- 
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cifically provide data and analyses on the impact #072 


of Federal funding on scientific and technical Evaluation Measures and Methods for On-Line Bibliographic 
COMMURICAION. Systems. King Research, Inc., 6000 Executive Boulevard, 
Rockville, MD 20852. Nancy K. Roderer and Robert R.V. 
#070 Wiederkehr. $143,846 for 15 months beginning August 14, 1979. 
Systems Analysis of Scientific and Technical Communication in 
the United States. King Research, Inc., 6000 Executive 
Boulevard, Rockville, MD 20852. Donald W. King. $11,394 
for 6 months beginning December 8, 1977. 


On-line bibliographic searching is an important 
relatively recent development in information services. 
A range of evaluation measures can be identified 


The number of scientific and technical articles 
published doubles every eight to ten years. The range 
of possible uses for the information in those articles 
and the number of users are also increasing rapidly. 
Therefore the limits of what can be communicated 
by printing, mailing, storing, and retrieving pieces 
of paper may be at hand. One possibility for im- 
proving communication is to use computer systems 
to capture and transmit new information. This is 
not the only alternative, but it is one which holds 
great promise for the productive use of information. 
The objective of this project is to devise an analytic 
framework which will allow publishers and other 
information distributors to judge the potential of 
the electronic alternative to paper-based com- 
munication in relation to the conventional system. 
This framework is intended for use in comparing 
total communication systems in terms of costs, 
benefits, and other implications defining issues; and 
in specifying the data needed for the resolution of 
those issues. 

This amendment to Contract NSF-C76-15515 
provides for consideration of the impact of the 
revised copyright law on the analysis and on the 
cost model developed as part of the project. 


for on-line bibliographic systems. Some are based 
on measures previously developed for manual 
systems and batch processed systems, but others 
relate to the interactive capabilities available on line. 
The purpose of this project is to review existing 
evaluation measures and to develop new ones for 
on-line bibliographic systems. Both information 
science literature and material in related fields such as 
computer systems, data base management, and 
numeric data bases will be reviewed. 

The evaluation will involve the application of 
cost-benefit analysis ‘o the overall bibliographic 
system, with parallel evaluation at function, product 
and service, and activity levels. Evaluation will be 
viewed from the perspective of all the major biblio- 
graphic system participants, including abstracting 
and indexing (A&I) services, information dis- 
semination centers, libraries, information brokers, 
and users. Both A&I and library cataloging data 
bases will be considered. 

Study tasks include development of an on-line 
system model, analysis of evaluation measures and 
methods, and selected testing of new approaches. 
Output of the study will be a monograph explicating 
evaluation results. 


#073 


#071 Uses of Citing Statements to Improve Retrieval. Lehigh 
Evaluation of Information Science Research. King Research, University, Bethlehem, PA 18015. John J. O'Connor. 
Inc., 6000 Executive Boulevard, Rockville, MD 20852. $116,037 for 24 months beginning July 5, 1978. 

Donald W. King. $30,000 for 6 months beginning Sep- 


C ter-readable full text of scientific journals 1 
tember 30. 1978. omputer-readable full text of scientific journals is 


becoming increasingly available as computer type- 


This six-month award provides for the develop- 
ment of a design to carry out an evaluation of the 
information science research program. The approach 
includes selecting representative information science 
research projects and developing methods for 
assessing their contribution to information science 
and technology. Products of the award will include: 
(1) selection of the projects to be evaluated; (2) de- 
velopment of measures of use and utilization of 
research results; 2nd (3) design of instruments and 
analysis procedures tor implementation of the design. 

Several design studies, in addition to this one, 
are being conducted concurrently. Phase Il awards 
will implement the most promising evaluation designs 
produced. 


setting is more widely used for economic reasons. 
When papers are available in computer-readable 
full text they can be retrieved by text searching. 
Such papers can also be employed to provide useful 
computer-readable information about other papers, 
which are not necessarily themselves in computer- 
readable form. 

Specifically, citing statements within papers 
represent what users have found important in the 
papers cited. If citing statements can be computer- 
identified in computer-readable full text, they can 
then be used to: (1) improve the reference retrieval 
performance of present retrieval services, (2) facilitate 
user processing of retrieval outputs, and (3) increase 
the usefulness of citation indexes. 


This project will investigate procedures for citing 
Statement recognition, and for the use of computer- 
recognized citing statements to improve reference 
retrieval and retrieval output processing. 


#074 


A Theoretical Basis for Knowledge Management. Lehigh 
University, Center for Information & Computer Science, 
Building 8, Bethlehem, PA 18015. Donald J. Hillman. 
$123,551 for 24 months beginning June 19, 1980. 


The purpose of this research is to provide a 
theoretical basis for the development of intelligent 
systems particularly those that involve the man- 
agement of knowledge and inference. Such systems 
will require that the capabilities possessed by existing 
information processing systems be substantially 
expanded. 

The project will construct a first-order theory that 
will relate the common features of knowledge 
management systems to a set of underlying concepts 
and deductive rules. It will then examine the com- 
pleteness of the theory and investigate its implications 
for system design. The first-order predicate calculus 
will be employed, and a relational approach to 
empirical data will be taken. 


#075 

Citing Statements: Recognition by Computer and Use to 
Improve Full-Text Retrieval. Lehigh University, Center for 
Information and Computer Science, Mart Library #8, 
Bethlehem, PA 18015. John J. O'Connor. $115,447 for 24 
months beginning July 22, 1980. 


Computer-readable full text of scientific journals 
is becoming increasingly available as computer 
typesetting is more widely used for economic reasons. 
Large-scale, full-text retrieval services for lawyers 
have existed ‘or a number of years. An advantage 
of such ret’ eval is that the full text of a publication 
contains far more information about itself than do 
its abstract and index terms. It also contains infor- 
mation about other publications, i.e., those it cites. 
If citing statements can be automatically recognized, 
they could provide very powerful access tools that 
could not only identify related documents but also 
link related concepts within documents. 

The investigator will study the following questions: 
(1) Can procedures for computer recognition of citing 
statements developed for one subject field (inorganic 
chemistry) be used effectively in other subject areas?: 
(2) Can the use of citing statements aid retrieval if 
full text is searched rather than index terms and 
abstract text?: (3) How much false retrieval will be 
produced by the use of citing statements to aid 
retrieval? 


#076 


Evaluation of the Use of Slow-Scan Television and Telefacsimile 
in a Library Network. Library of Congress, Federal Library 
Committee, Navy Yard Annex, Room 400, Washington, 
DC 20540. James P. Riley and Joan M. Maier. $145,905 
for 24 months beginning August 23, 1978. 


In a cooperative effort, fifteen Federal and state 
libraries will be experimenting with the use of slow- 
scan television and telefacsimile devices to transmit 
images of document pages over telephone lines for 
the purpose of testing the concept and practicability 
of electronic document delivery in a nationwide 
network environment. Impact on library patrons, 
librarians, and library administrators will be 
assessed. Data on network activities will be col- 
lected and analyzed by an independent contractor. 
In addition to interlibrary loans, the system will 
also be tested for its usefulness for teleconferencing, 
work sessions, training, and other appropriate 
applications. 


#077 


Representation and Processing of Spatially Structured Infor- 
mation in Cognition. University of Louisville, Systems Science 
Institute, Louisville, KY 40208. Bruce A. Whitehead. $77,840 
for 24 months beginning September 5, 1980. 


Knowleuge about space forms a basic framework 
for processing information about the world. The 
principles of representing and processing such spatial 
knowledge are the topic of the proposed research. 
The objectives are to better understand these prin- 
ciples, both within the specific constraints of human 
information processing and in the more general case 
in which some of these constraints are relaxed. The 
feasibility of such principles under well-defined 
constraints will be investigated and tested in a 
simulation model. Cognitive operators for spatially 
structured information will be modeled on the basis 
of both physiological and psychological data. The 
modeling technique will bring this human data to 
bear upon principles stated in the broader framework 
of information science. The generalizability of these 
representation and processing principles will then 
be explored. 


#078 


A Simultaneous Equation Model of the Demand for Academic 
and Professional Journals. University of Lowell, Department of 
Economics, One University Avenue, Lowell, MA 01854. 
Carol C. McDonough. $25,342 for 12 months beginning 
May 9, 1980. 


The objective of this proposal is to construct and 
estimate a simultaneous equation model of the 
demand for academic and professional journals. 


~~ 
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A four-equation model will be developed that #08 | 


yonnnw both — and ——e pournel A Generalized Model of Document Retrieval. University of 
SARS, JOUTRE CON, JOUTRE price, = journal Massachusetts, Department of Computer and Information 
product (measured as number of pages) in terms of Science, Amherst, MA 01003. W. Bruce Croft. $61,740 
parameters that are exogenous to the model. Such for 24 months beginning June 27, 1980. 
variables include: journal quality, size of the potential 
journal market, average annual journal costs, number The purpose of this research is to enhance both 
of journals published by the publisher, and the the understanding of information retrieval processes 
technical /scientific nature of a journal, and a and the capabilities for designing more effective 
journal's subsidy profit relationship with its retrieval systems. The project includes tasks to 
publisher. develop a probabilistic retrieval model and to 
evaluate performance of the model by applying 
several measures to two test collections of documents, 
#079 queries and relevance judgments. 
Research on Behavioral Variables Related to Information Specifically, the research will extend the proba- 
Systems. University of Maryland, College of Library and bilistic models based on binary index terrns (those 
Information Services, College Park, MD 20742. Jerry S. of Spark-Jones & Robertson and Van Rijsbergen) 
Kidd. $163,011 for 18 months beginning August 22, 1979. to include weighted index terms. Two conditional 
The project will utilize techniques borrowed from probabilities are introduced consisting first of the 


probability that a term occurs in a relevant docu- 
ment, and then the probability that the term occurs in 
a given document description. The retrieval results 
will be evaluated using two test collections and 
applying recall and precision tables of E values, 
and a measure relating total number of relevant 
documents retrieved to queries. 


cognitive psychology and artificial intelligence 
{scripts / frames) in order to identify a scientist's 
context and status of research at any particular point 
in a research project. The purpose is to be able to 
tailor information products to meet the specific 
requirements of the research process as they change 
during the various phases of a project. 

The major task will be to determine whether it 


will be possible to prepare realistic, detailed scripts #08? 

and frames. Experimental Evaluation of Information Delivered in \ arious 
The study is intended as the first step toward the Modes and Combination of Modes. Massachusetts Institute 

ultimate goal of developing an information system of Technology, Center for Advanced Engineering Study. 

which will interact with a scientist in diagnosing Cambridge, MA 02139. Karen C. Cohen. $131,746 for 24 

his her information needs, in constructing a schema months beginning September 26, 1979. 


which places these needs in the context of what he 
already knows, and in matching this knowledge 
structure (schema) with scientific information con- 
tained in the system data bases. 


The purpose of this study is to evaluate the 
effectiveness of various modes of information 
transfer. Modes to be assessed will include: lecture. 
reprints in print iorm, facsimile reprints accessed 
on-line, and audio cassettes of lectures. Effectiveness 


#080) will be defined and examined in terms of user 
Applications of Logic and Theorem Proving to Databases. characteristics, user criteria for selection of one or 
University of Maryland, Department of Computer Science, more modes, and actual transfer of knowledge. 

College Park, MD 20742. Jack Minker and Barry E. Jacobs. The study will access the various modes singly 
$207,835 for 36 months beginning April 18, 1980. and in combinations. The critical events will be 


lectures delivered at annual and national meetings 
of the American !nstitute of Chemical Engineers. 
The alternative modes will be reproductions of the 
contents of these lectures. 

Results should interest information science re- 
searchers concerned with understanding the effects 
of modes on transfer of information, and systems 
designers who wish to improve the delivery of factual 
information. 


The mayor thrust of this research ts the application 
of automated theorem proving and mathematical 
logic to database investigations. Three types will 
be treated: {1} theorem proving by linear resolution 
with unrestricted selection function using trees 
{LUST theorem proving}, (2) first-order logic and 
databases, and (3) database logic. Database logic 
isa recently developed formalism whereby existing 
database systems can be viewed in terms of logic. 
It has the prospect of providing a unifying theoretical 


framework for database systems. #083 
This project continues research supported under The Use of intonation in Automatic Speech Understanding. 
NSF Grant MCS873-03433 A02. Massachusetts Institute of Technology, Center for Cogni- 
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tive Science, Cambridge, MA 02139. Janet Pierrchumbert. 
$43,701 for 24 months beginning July 3, 1980. 


Intonation, as expressed in the fundamental 
frequency of the voice is related to the location of 
Stress in words and to syntactic phrasing. The 
proposed research is directed towards using the 
intonation system to help a computer system, for 
understanding speech, determine what words it has 
heard, and how these words are organized into 
sentences. It is anticipated that work in this area 
will contribute to making speech understanding 
systems good enough to be useful in many practical 
applications, and that the research will also add to 
the understanding of how people process speech. 
Pilot work suggests that people use a number of 
qualitatively different intonation patterns even when 
asked to speak neutrally. Because of this variety, 
the location of stressed syllables and phrase bound- 
aries cannot be identified from the fundamental 
frequency contour. The investigator proposes to 
design a computer program which would screen 
possible analyses of an utterance, reyecting an analysis 
if the observed fundamental fequency contour 
represented an impossible intonation for the stress 
and phrasing hypothesized. 


#OX4 


A Research Program on Electronic Document-Delivery Systems 
for Interlibrary Loans. Massachuseiis Institute of Technology, 
Electronic Systems Laboratory, 77 Massachusetts Avenue, 
Cambridge, MA 02139. J. Francis Reintjes. $83,725 for 12 
months beginning February 14, 1979. 


The project sets forth a two-year research pro- 
gram pertairing to electronic document delivery 
systems applicable to interl*brary loans. In the first 
year, a study will be made of technological alterna- 
tives for the various segments of an electronic 
document delivery network. including information 
encoding methods, data compression schemes, 
communications channels and receiving end facih- 
ties. Non-technological issues that impact upon the 
technological organization of the network will also 
be studied. In the second year, experimental evalu- 
ations will be made of selected basic approaches 
that show promise of contributing substantially to 
electronic interlibrary loan networks. An effort will 
also be made to establish an experimental two-mode 
setwork as a means of assessing the viability of the 
electronic approach from technological and non- 
technological viewpoints. 


#085 

Electronic Information Exchange in Mental Workload Re- 
search. Massachusetts Institute of Technology, Cambridge. 
MA 02139. Thomas B. Sheridan. $148,454 for 18 months 
beginning July 19, 1978. 


The problem of defining and measuring mental 
workload is a critical one in complex man-machine 
systems such as aircraft piloting and traffic control, 
nuclear reactors, and various military systems where 
there are large risks to public safety and large 
concentrations of capital under the control of one 
or few persons. In such systems perceptuz! and 
decision demands may change rapidly from a state 
of boredom when automation functions perfectly 
to a state of high stress and operator unreliability 
when automation fails. 

Thirty researchers and engineers concerned with 
mental workload in the USA, Canada, and Great 
Britain will employ the New Jersey Institute of 
Technology's facility for research on electronic 
information exchange to define terms and experi- 
mental questions; share knowledge of applicable 
theory and research: organize and critique research in 
progress; and draft, edit, review and publish archival 
papers. These papers are to be transferred to files 
publicly accessible by anyone through a commercial 
computer network, and the availability and use of 
this services is to be advertised throughout the 
broader community of interest. 


#086 
Investigations of Computer-Aided Document Search Strategies. 


Massachusetts Institute of Technology, Laboratory for 
Information and Decision Systems, Cambridge, MA 02139. 
Richard S. Marcus and J. Francis Reintyes. $155,360 for 
24 months beginning June 11, 1980. 


The purpose of this research is to simplify the 
tasks which confront users of online retrieval systems 
in their attempts to gain access to desired informa- 
tion. The difficulty of learning and using existing 
systems is demonstrated by the fact that those who 
ultimately need and use the information derived 
from the retrieval systems by and large cannot and 
do not operate the systems themselves. 

This project is aimed at investigating how the 
user's effectiveness and efficiency of searching can 
be improved by providing him with computerized 
assistance in the formulation and execution of his 
search strategy. More specifically it will develop 
and test procedures and dynamic aids that will help to 
optimize search strategy formulation in conjunction 
with initiatives taken by the user. These techniques 
derive from an emerging theory of online search 
opumization which calls for a combination of factors 
including a knowledge base, computational lin- 
guistics, and computer-assisted instruction techniques 
to automatically select appropriate search strategies 
and data bases. 


#087 


Investigations of Electronic Interlibrary Resource-Sharing 
Networks. Massachusetts Institute of Technology, Laboratory 
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for Information and Decision Systems, 77 Massachusetts 
Avenue, Cambridge, MA 02139. J. Francis Reintjes. $95,000 
for 12 months beginning June 11, 1980. 


This is a continuation of NSF Grant IST78-23251 
entitled ““A Research Program on Electronic 
Document-Delivery Systems for Interlibrary Loans”. 


#088 


Natural Computation and Control. Massachusetts Institute 
of Technology, Cambridge, MA 02139. Whitman A. Richards. 
$234,196 for 12 months Uxcginning April 11, 1980. 


Research in natural computation explores the 
information processing mechanisms utilized by 
biological systems in performing basic sensory and 
motor functions. The MIT effort is primarily con- 
cerned with understanding the computational mech- 
anisms which underlie the human visual system's 
representations of the external world. Topics to be 
investigated will include: (1) stereopsis, (2) texture, 
(3) structure from motion, (4) transparency, (5) color 
perception and normalization, and (6) derivation 
of object shapes from contour and shading. 

This project continues research supported by NSF 
Grant MCS77-07569. 


#OR9 


Computer-Based System for Sharing Technical Information. 
University of Michigan, Center for Research on Learning 
and Teaching, Ann Arbor, MI 48109. Karl L. Zinn. $20,000 
for 12 months beginning May 5, 1978. 


This project is an extension of a research grant 
testing the feasibility of using teleconferencing and 
electronic-publishing techniques to facilitate the 
sharing of scientific and technical information among 
individuals in a scientific community. The infor- 
mation to be shared is stored in a generally accessible 
computer system. A computer communication 
network allows one who originates information to 
disseminate it to an audience of potential users 
without the delay, expense or editorial constraint 
of conventional publication. The continuation will 
carry the project activity further into the communities 
involved, and examine the effects in greater depth 
through interviews. Most participants are from 
universities, others are from government and 
industry. 


#090 

Computer Studies in Formal Linguistics. University of Mich- 
igan, Department of Computer and Communication Sciences, 
2076 Frieze Building, Ann Arbor, MI 48109. Joyce Friedman. 
$87,508 for 24 months beginning June 28, 1979. 


The purpose of the proposed project is to study 
logic-based formal systems for natural language, 


#09 


Links of Interest and Expertise Among Scientists. University 
of Michigan, Ann Arbor, MI 48109. Manfred Kochen. 


$49 


#09 


Dynamics in the Organization of Scientific Specialties. 
versity of Michigan, Mental Health Research Institute, 
Ann Arbor, MI 48109. Manfred Kochen. $134,782 for 24 


Uni 


using the computer as a research tool. Formal 
linguistic theories can explicate and clarify the 
relationships among syntax, semantics, and prag- 
matics; computer implementation imposes a proc- 
essing view which adds a new dimension to the 
theory. Montague grammar provides a well-defined 
fragment in which important questions can be 
phrased precisely and answers can be formulated 
and tested. 

The main goals of the project are (1) to use 
mathematical and computational techniques to 
explore the syntactic and logic components of 
Montague grammar, (2) to develop further and more 
formally the notions of dynamic int<7pretations, 
and (3) to investigate their application to problems 
in pragmatics. 

The results of the project will be LISP programs 
and formal papers. These will be usable by other 
linguists in doing research and in teaching formal 
grammar. 


.621 for 12 months beginning August 14, 1978. 


Connections among scientists based on interests 
and expertise are vital for the scientific enterprise 
in the selection of reviewers and in building repu- 
tations through recommendations. Scientists use the 
connection network to amplify their individual 
capacities to cope with the volume and difficulty 
of accumulating scientific knowledge. Understanding 
the network of connections is necessary for providing 
information systems with a scientific foundation 
and for clarifying the issues regarding the healthy 
growth of scientific knowledge. The researchers will 
examine the functioning of coupling networks and 
their impact on organizing specialty clusters of 
scientists; develop and test new methods for 
observing how specialty clusters are formed: and 
develop a method of describing interests and com- 
petence to contribute and to evaluate research. 


’ 


months beginning August 7, 1980. 


The processes whereby scientists add to knowledge 
in their specialties affect the structure of scientific 
information. Understanding how specialties grow 
helps in designing information processing systems 
for scientists. Communication among spe-ialists and 
the organization of knowledge play a key role. 

Four tasks will be undertaken to explicate the 
growth of specialties: (1) search for a representation 


of a specialty state; (2) derivation of difference 
equations for how the state-variables change: 
{3} analysis of these nonlinear difference equations; 
(4) evaluation and extension of this model to deal 
with many specialties simultaneously and to char- 
acterize knowledge spaces and the invariance of laws 
expressing knowledge dynamics under transforma- 
tion from one representation to another. 

These tasks will be accomplished primarily by 
mathematical analysis, construction and use of 
computer programs to generate and compare rep- 
resentations and sensitivity analysis of the difference 
equations. Some empirical studies, using samples 
of mathematical literature to trace the use of 
theorems, methods, conjectures and concepts in the 
generation of new knowledge units will be under- 
taken to justify the dynamic model and to compare 
the implied direction of change in the state-variables 
with past trends. 


will implement the most promising evaluation designs 
produced. 


#095 


International Scientific and Technical Information Programs. 
National Academy of Sciences, 2102 Constitution Avenue, 
NW, Washington, DC 20418. Judith A. Werdel. $96,500 
for 12 months beginning March 6, 1979. 


This award provides support for continuation of 
U.S. participation in the International Federation 
for Documentation (FID) and the International 
Council of Scientific Unions/ Abstracting Board 
(ICSU / AB), as well as for continuation of certain 
information clearinghouse services within the Na- 
tional Academy of Sciences-National Research 
Council [NAS-NRC) that relate to international 
scientific and technical (S&T) information. Support 
for 1979 U.S. dues to both international organiza- 
tions is included in the award, as is support for the 


#093 


A Conference on Serial Collections in a Network Environment. 
Minnesota Higher Education Coordinating Board, St. Paul, 
MN 55101. Alice E. Wilcox. $32,469 for 12 months begin- 
ning September 29, 1978. 


This conference will bring together 6 leading 
researchers and 30 academic librarians to suggest 
guidelines for the development and managment of 
serials collections in academic libranes participating 
in resource-sharing networks. The attendees will 
examine and evaluate the results of a number of 
research efforts performed by the 6 researchers and 
will synthesize these results into practical guidelines 
for use by academic librarians in their planning and 
management functions. 


U.S. National Committee for FID and ICSU / AB. 
FID and ICSU AB are two of the principal inter- 
national organizations concerned with S&T infor- 
mation and information technology. Through them. 
the U.S. scientific, technical and information com- 
munities Cooperate internationally in many programs 
directed toward development of S&T documentation 
and the organization exchange, and dissemination 
of in ormation for S&T abstracting and indexing 
Services. 

Clearinghouse and information services within 
the NAS-NRC that are supported by this award 
constitute an important U.S. resource for the col- 
lection and referral of information on international, 
bilateral, and foreign national S&T information 
organizations, programs, and iv< ~s. 


#094 #096 

Systematic Process Evaluation of Fundamental Information Refinement and | se of Concept Similarities in Information 
Science Research. Mitre Corporation, Metrek Division, 1820 Retrieval. National Bureau of Standards, \nstitute for 
Dolley Madison Boulevard, McLean, VA 22101. Karen Computer Sciences and Technology, Building 225, Wash- 
Levitan. $29.847 for 6 months beginning September 26, 1978. ington, DC 20234. Matthew B. Koll. $35,498 for 12 months 


beginning September |, 1980. 


This six-month award provides for the develop- 
ment of a design to carry out an evaluation of the 
information science program. A systems approach 
to evaluation ts taken in the study. It will attempt 
to identify and describe the set of interrelated 
variables, which influence program outcomes. Its 
products will include: mapping of the field of 
information science; a design for examining non- 
NSF sources of funding for information science 
research: an initial description of the current state 
of knowledge in information science as well as 
related fields, and the characteristics of proposed 
information science research. 

Several design studies, in addition to this one, 
are being conducted concurrently. Phase Il awards 


It 1s well established in information retrieval theory 
that the ability of a system to retrieve relevant 
documents is dependent on the agreement between 
the system's and users’ concepts. In designing IR 
systems, this is often stated as a vector space model of 
concept similarities. This model 1s the focus of the 
proposed research. The aim of this study is to 
maximize the agreement between the concept simi- 
larities derived from text, and those perceived 
similarities held in the population of data base users. 
The project will also examine methods by which 
IR systems can use the enhanced concept relations 
The final output will be the design of an IR system 
which utilizes such enhancements. 


The methodology involves human users as well 
aS experimental machine runs. Users’ views about 
the similarity of concept pairs will be obtained from 
questionnaires. System's views will be obtained from 
term co-occurrence data, weightings, and word 
distance measures. By measuring the closeness 
between these two v'*.. . predictions will be made 
regarding the usec © statistical techniques as a 
means to s mulate existing semantic relations. 
The research will test varic .s algorithms for creating 
the concept space from term co-occurrence data in 
the text of a data base. 


operated by the New Jersey Institute of Technology 
under contract with the Foundation. Users of this 
facility can exchange messages, participate in con- 
ferences, and collaborate in the preparation of 
documents even though they are dispersed through- 
out the country and utilize the system on different 
schedules. The Foundation has already made the 
facility's resources available to twenty users to be 
named by the Commission. This award allows the 
Commission to designate eighteen additional users 
and to acquire the further resources needed for 
effective employment of the facility by all thirty- 
eight of its users. 


#097 


Support of the Numerical Data Advisory Board (NDAB). = #99 

National Bureau of Standards, Office of Standard Reference Study of Impacts of a Formal Computer-Based Information 
Data, Room A323, Physics Building, Washington, DC 20234. System on Information Networks. Naval Ocean Systems 
James E. Loy. $12,300 for !2 months beginning August Center, System Develozment Branch, San Diego, CA 92152. 
Ww, 1979. Benjamin A. Saltzer. $71,891 for 18 months beginning 


The Numerical Data Advisory Board (NDAB) May 23, 1978. 
of the National Academy of Sciences (NAS) provides The project is to determine and quantify the 
a national forum for the consideration of the whole impacts of formal computer-based information 
range of problems associated with the compilation sy‘cems on the flow of scientific and technical 
and dissemination of scientific data. Through the information within and among engineering groups. 
establishment of working groups of carefully selected These may include changes in the quantity and 
experts the NDAB carries out detailed studies and quality of the flow, the key roles, and changes in 
“ submits reports relating to the generation, processing. the informal intergroup information network. A 
storage, retrieval and use of critically evaluated data. balanced mix of quantitative and qualitative tech- 
Among the efforts planned for the coming year are niques is to be used to gather and analyze data 
two of special interest to the Foundation: “encour- collected “before.” “during.” and “after” the intro- 
aging the development and application of new duction of computerized information systems to the 
methods of data management.” and “identifying groups. Participants in the study are groups of 
specific areas of data management and information engineers and scientists working in the area of sonar 
science in need of further research and development.” and undersea acoustic systems. 
The National Bureau of Standards 1s the executive 
agent for Federal agencies contributing to support #100 
of the NDAB program. Human Attention and Performance Limitations in Processing 
Visual Information. l niversity of Nebraska, 208 Burnett Hall, 
wns Lincoln, NE 68888. John H. Flowers. $34,672 for 12 months 
Electronic Information Exchange in Planning for White House beginning July 30, 1979. 
Conference on Library and Information Services. National 
Commission on Libraries and Information Science, \7\7 K 
Street, NW, Washington, DC 20036. Marilyn Gell. $40,335 
for 7 months beginning August 27, 1979. 


The research is concerned with the ability of 
human observers to allocate attention to spatial 
regions of a vistal display, and their ability to ignore 
irrelevant visual information which appears in the 


In November 1979, the National Commission on 
Libranes and Information Science will hold the first 
White House Conference on Library and Informa- 
tion Services. The conference will be attended by 
more than 1100 delegates, drawn from the infor- 
mation professions and the general public. Its goal 
is to examine a variety of ways in which libraries 
and information centers can more effectively bring 
information to the people of this country. To 
facilitate planning for the Conference, the Com- 
mission will make use of the Electronic Information 
Exchange System, a research facility developed and 


vicinity of their attentional “focus.” One series of 
experiments will examine how structural properties 
and famiharity presented in one portion of a brief 
visual display affect accuracy for recalling or rec- 
ognizing material appearing in other regions of the 
display (e.g.. attention “capture™). A second sequence 
of experiments will study interference effects caused 
by distractors which appear in the spatial vicinity 
of a fixated “target.” and will examine how the 
focus of the attentional “beam” can be influenced 
by a number of task variables which change the 
state of the observer. The data obtained will be 


used in the evaluation and development of models 
of attention which will provide clues for the design 
of aruficial systems capable of selective abstracting of 
input information. 


#101 


Electronic Information Exchange Test Facility. Vew Jersey 
Institute of Technology, 323 High Street, Newark, NJ 
07162. Murray Turoff. $275,197 for 18 months beginning 
August 3, 1978. 


The Electronic Information Exchange System 
(EIES) ts operated as a facility for experimental 
research on the use of computer-based communi- 
cation and information resources by geographically 
dispersed individuals. Based on a dedicated mini- 
computer with more than 200 milhon characters of 
disc storage available for direct support of users, 
the EIES offers such features as private message 
exchange, computer-based conferences, on-line 
notebooks, and an all-electronic journal. By means of 
TELENET, a comevercial packet-switching tele- 
communications system, the full range of these 
capabilities can be made available to 300 participants 
in NSF research projects. 


#102 


Electronic Information Exchange Test Facility. New Jersey 
Institute of Technology, 323 High Street, Newark, NJ 
07102. Murray Turoff. $2,000 for 0 months beginning 
March 28, 1980. 


Supplement to IST-C77-21008 


#108 


Probabilistic Measurement Theoretic Models of Information 
Processing. New York University, Washington Square. 
New York, NY 10003. Jean-Claude Falmagne. $156,909 
foi 31 months beginning April 2, 1980. 


The goal of this project ts to further mathematical 
understanding of a broad class of mechanisms that 
appear to arise in human information processing. 
The investigator will study probabilistic models 
intended to describe the transducuon and recoding 
of mput information into random vari.bles, and 
how choies and decrsions are made by comparison 
of the relevant random variables 


#\04 


An In-Depth Analysis of a Comprehensive Data Base on the 
Communication of Scientific ‘Scholariy Knowledge. New ) ork 
University, Department of Economics, $38 Tisch Hall, 
Washington Square, New York, NY 10003. Fritz Machlup 
$160,178 for 12 months beginning July 10, 1978 


This project 1s for additional and in-depth 
analyses of the data base developed by the principal 
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investigator in the course of earlier research on the 
production and distribution of scientific and tech- 
nological information. Dzta regarding the publi- 
cation of books and journals will be analyzed in 
greater detail than previously possible in such areas as 
book sales, by field, over a period of 8-10 years, 
and sales data, by type of purchaser; costs of journal 
distribution, and ratios of operating costs to total 
costs. Subscription prices and circulation data will 
also be examuned in order to estimate price .‘asticities 
of demand hy type of subscribers. Additional anal- 
yses of library data will also be performed. 


#105 


Information Science: An Analysis of Methodological Issues 
and Interdisciplinary Relationships. New York University. 
Department of Economics, Washington Square, New York, 
NY 10003. Fritz Machlup. $39,973 for 12 months beginning 
May 27, 1980. 


The purpose of th’s project is to analyze the 
various methodological issues and interdisciplinary 
relationships which arise from the fact that infor- 
mation science is not yet a fully integrated assem- 
blage of systematic studies. It consists of various 
approaches addressed to investigations of the opera- 
tions, devices, technology and organization, by 
which humans transmit and receive, and thereby 
transfer knowledge among one another. It is nec- 
essary to investigate the ways in which these various 
approaches interact in order to form a picture of 
the relationships involved. 

Specialists in the following disciplines will be 
invited to prepare papers on the intra and inter- 
disciplinary relationships as they see them: Library 
Science; Computer Science; Human Communica- 
tions including Cognitive Psychology: Systems 
Theory; Communication Theory; Living Systems: 
Artificial Intelligence, Cybernetics; and Decision 
Theory. The Principal Investigator will analyze the 
logical and methodological processes in each of these 
approaches and synthesize a coherent description 
of the present status of information science. 


#106 


Publication of Proceedings of a Conference on Telecom- 
munications and Productivity. New York University, Center 
for Scrence & Technology Policy, New York, NY 10003. 
Mitchell L. Moss. $9,976 for 12 months beginning April 
8. 1980. 


The Center for Science and Technology Policy 
at New York University held a two-day conference 
to explore the social and economic consequences 
of new telecommunications systems. 

This conference identified emerging uses and 
trends and examined organiztional factors affecting 
the application of telecommunications systems. The 


major issues discussed are: {1} Present and future uses 
of telecommunications systems in the home and 
office; (2) The potential impact of communications 
technologies on productivity in public and private 
organizations; and (3) Alternative policies for future 
research and development in telecommunications. 

The proceedings of the conference will include: 
conference papers, summaries of the workshop #109 


sessions, and an interpretative summary of the Computer-Based Investigation into the Structure of Informati 

complete conference. The proceedings will be pub- New York University, Linguistics Department, New York 

lished by the Addison-Wesley Publishing Company. NY 10003 Naomi Sager. $274,645 for 24 months beginning 
January 25, 1980 


The proposed research will utilize information 
formats as the means for comparing and quantifying 
structural properties of information in different 
subject areas and on different levels of generality 
of information. An information format is a row- 
column matrix derived by formal and computerized 
procedures to sample texts in a given area such as 
a subfield of science into which successive sentences 
of texts in the field can be mapped. These information 
formats then contain the same information as the 
original texts. 

It should be possible to obtain precise descriptions 
of three levels of information: (1) ind: vidual facts 
{as in reports); (2) factual summaries and general- 
ization (as in survey articles): and (3) fact correlations 
and analysis (as in research articles}. A hypothesis 
to be tested is that relations between the structures 
on the different levels can be established, showing 
how particular factual information is transformed 
to become the object of generalization and the 
operand of theoretical constructs. 


language types. A prototype query language appro- 
priate to natural-language information systems will 
be designed and evaluated. The results of the research 
should provide a clear picture of the technical 
requirements for extending data base technology 
for use with natural language data bases in the future. 


#107 


Research into Methods for Automatic Classification and Fact 
Retrieval in Science Subfields. New York University, De- 


partment of Linguistics, New York, NY 10003. Naomi Sager. 
211,000 for 24 months beginning December 29, 1977. 


The purpose of this research is to expand the 
fundamental understandings which are essential to 
the eventual design of information systems which 
can analyze, process, combine and deliver desired 
information from natural language data bases. Such a 
capability would supplement existing document 
retrieval systems by operating on the natural lan- 
guage texts of the documents in order to determine 
whether desired factual relations are present in the 
data base. In some cases, this type of capability 
could even compare and compile data from different 
source documents by way of treating the natural 
language texts of the documents. 

The first objective is to develop procedures and 
computer programs for obtaining the informational 
word classes of a science subfield and for classifying 
textwords according to these categories. The second 
objective is to extend our knowledge of information #110 
Structures in science anj to the compatibility of Computer Applications to Analytical Chemistry. University 
such structures with texts in the publishing literature. of North Carolina, Chapel Hill, NC 27514. Thomas L. 
Thus, the research will develop automatic methods Isenhour. $71,000 for 12 months beginning May 15, 1978. 
pd -aggene Denagy raareee ser. genatnegueqerse Dr. Isenhour is studying further and developing 


and for tagging the individual words with the the follow: oo aaaitea os denaiiatien! 
appropriate class markers. The development of such veh “a Se ee ae 
chemistry. 


: ssential if adv: language-based, j , 
procedures are essential if advanced language A. Rank and factor analysis methods are being 


as Opposed to document-based information proc- w 
essing is to be applied in diverse areas of science. Coveloped for mer waplpeeten Chere ne me wguenel 
of components in a mixture and isolating pure 


#108 component spectra for file searching. Mixtures of 


Xylenes and saturated hydrocarbons are being used 
Data Base Management Systems for Natural Language Data ne ncergart ye en patron aren 
Bases. New York University, |S Washington Place, New York, 


ener further tested on complex pesticide mixtures. 
NY 10003. Naomi Sager. ‘$188,943 for 24 months beginning B Inf d 
' : spectro: led with g: 
September 7, 1978. nfrared spectroscopy coupled with gas 


chromatography is being developed as a method 


This project will explore the application of current 
data-base management systems (DBMS) to the 
management of natural language data bases. Natural 
language information structures will be compared 
with the information structures handled by existing 
DBMS of the hierarchical, network and relational 


of analysis of complex samples. Data collected in 
the Fourier domain are used to reconstruct the gas 
chromatogram much more efficiently than in the 
standard approach of transforming each individual 
interferogram. Further study is in progress, coupled 
with Part A, to determine background information, 


isolate pure spectra and develop automated pro- 
cedures to analyze complex mixtures. 

C. As data bases have become larger and better, 
a great deal of attention has been given to developing 
search systems. There is, at present, no standard 
method of comparing the capability of search 
systems. In this study an error propagation method of 
evaluating the performance of library search simi- 
larity measures is being developed and applied to 
various search systems of mass spectral data. The 
technique is based on repeated searches several 
nearest matches to selected compounds. Performance 
is being judged by the rate at which the dissimilarity 
between target compounds and propagated matches 
increases. 

This research is a continuation of that currently 
supported by Foundation Grant CHE77-09762. 


#11’ 

Computer Applications to Analytical Chemistry. University 
of North Carolina, Chapel Hill, NC 27514. Thomas L. 
Isenhour. $66,500 for 12 months beginning June 11, 1980. 


This research is a continuation of work currently 
supported by NSF and described above. 


#112 


Development of a Conceptual Framework to Guide Research 
in Information Science. NTL Institute for Applied Behavioral 
Science, 1501 Wilson Boulevard, Arlington, VA 22209. Edwin 
E. Olson and Dale Lake. $49,611 for 12 months beginning 
September 7, 1978. 


This project aims to identify promising models 
and paradigms in the social and behavioral sciences: 
evaluate their application to information science 
problems: and encourage the information science 
community to use those methods, assessed to be 
useful, in their research. 

A three-step process will carry out these objectives. 
First, a select group of accomplished social and 
behavioral scientists will be convened to develop 
an inventory of models with the potential to guide 
information science research. This will be followed 
by a symposium in which a iarger group of scientists 
will present and discuss technical papers to critique 
and expand the output of the first session. 

Results of the symposium will be made widely 
available to the information science research com- 
munity, as well as to behavioral and social scientists. 


#113 


Message Spaces and Computing Time in Performance Allo- 
cation Processes. Northwestern University, Center for Math- 
ematical Studies in Economics and Management, 633 Clark 
Street, Evanston, IL 60201. Kenneth R. Mount. $78,409 
for 24 months beginning August 7, 1980. 
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Information plays both an implicit and explicit 
role in the allocation of economic products and 
factors of production. Firms and governments are 
large complex entities explicitly processing infor- 
mation about current and future technology, re- 
sources, demography, etc. Despite this imporiant 
role in economic matters, information aspects nave 
been studied largely as special topics, rather than 
as an integral component of the allocation process. 

An investigation of informational and computa- 
tional requirements for economic allocation mech- 
anisms is proposed. Particular attention will be given 
to incorporating into the model the information 
processing costs internal to a particular economic 
unit. Internal costs are important because if they 
are ignored, then the merging of two units appears 
to decrease the information requirements of the 
system, whereas in fact merger merely internalizes 
communication between the merged units. Tradeoffs 
between the amount of information contained in 
economic systems and complexity of computation 
will be examined. 


#114 


Research on the Impact of a Computerized Circulation System 
on the Performance of a Large College Library. Oberlin 
College, Oberlin, OH 44074. Virginia B. Harris. $139,980 
for 24 months beginning September 21, 1978. 


The project is an in-depth study of the impact of 
an automated circulation system on the performance 
of a large college library. It will examine the rela- 
tionships among three types of variables: (1) user 
attitudes, (2) system states [i.e., the operating status of 
the system}, and (3) objective performance [i.e., the 
quality and quantity of the services re’ dered). Data 
collection will be integrated into the automated 
circulation system. Analysis and use of data will be 
integrated into regular library operations and 
management. One of the goals of the project is to 
develop and refine objective performance meas- 
urement mechanisms for library operations. 


#115 


The Probabilistic Matching and Control of Author Names in 
Automated Library Systems. OCLC, Incorporated, |1\25 
Kinnear Road, Columbus, OH 43212. Thomas B. Hickey. 
$42,321 for 12 months beginning August 21, 1979. 


The process of author name matching is one of 
the most difficult and complex problems which 
confronts computerized bibliographic data bases 
such as library catalogs. The difficulties arise from 
such factors as: size, diversity of different author 
names, a high proportion of foreign names which 
have been romanized, and pseudonyms and other 
name variants. When names cannot be effectively 
matched against search queries the result can be 


“lost records. In time these problems can distort 
the accuracy of a data base. The purpose of this 
research is to increase the role of computers in 
name-matching. This will make it possible for 
computers to create and monitor authority files with 
less human effort and greater data base integrity 
and searching ability. 

The techniques and algorithms which will be 
investigated will enable a computerized system to 
automaticaily: correct and/or flag typographical 
errors in names; identify names in a data base that 
are similar to a name entered by a search query; 
measure the similarity among names by computing 
the probability that two or more names refer to the 
same person. These techniques will generalize to a 
wide variety of problems ranging from those en- 
countered by libraries to those which are peculiar 
to abstracting and indexing services. It is expected, 
in particular, that the resultant techniques will form 
the basis for a new approach to authority control 
within automated library systems. 


#116 


Computer Terminal Requirements for On-line Catalogs in 
Libraries. OCLC, Inc., 1125 Kinnear Road, Columbus, OH 
43212. Neal K. Kaske. $97,902 for 12 months beginning 
May 22, 1980. 


This proposal addresses problems related to the 
role of the library as both an agent of information 
transfer and a user of information technology. 
Specific project objectives include: develop measures 
of library use with particular emphasis on card 
catalog and online catalog use; identify patterns of 
human interactions with the two catalogs; develop 
and test an algorithm for estimating the number of 
public computer terminals needed by a library to 
support an online public catalog. Library use data 
will be collected at the Ohio State University 
Libraries which have both online and manual catalog 
systems. 

This research will produce guidelines that libraries 
can use to estimate the number of computer terminals 
needed to support an online public catalog. These 
guidelines will be independent of library type and 
size, and will benefit individual libraries, systems 
of libraries, and library networks in their economic 
and systems planning. 


on previous research which has been underway at 
Ohio State University over the past several years. 
The approach relates information to its effectiveness 
and thus its use and value. Theory, measures, and 
relationships thus derived relate to the way infor- 
mation, once it is transmitted and received, is utilized 
by a decision-maker. 

More specifically the research attempts to: (1) iden- 
tify and quantify important variables and parameters 
in the information flow process; (2) establish rela- 
tionships among these variables; (3) apply the theory 
to practical situations and to examine the resulting 
implications; and (4) develop models both simulation 
and experimental, to utilize and validate the theory. 


#118 


Concurrency in the Access and Manipulation of Information 
Structures. University of Cregon, Eugene, OR 97403. Carla 
Schlatter Ellis. $62,500 for 24 months beginning June 18, 
1980. 


One of the objectives of data base design is to 
collect the information needed for various related 
applications and organize it into a unified system 
that allows efficient access and modification. With 
many users sharing the data structures of a large 
data base, purely sequential operations may provide 
less than satisfactory performance. 

Concurrent operations, in contrast, potentially 
improve performance but introduce new problems 
(e.g., indefinite postponement) and aggravate old 
ones (e.g., consistency). Thus an important area of 
investigation deals with techniques for increasing 
concurrency of operations (primarily search, inser- 
tion, and deletion) on the kinds of data structures 
employed in data base systems. 

In sequential programming “good” solutions are 
often based on recognized techniques or Gata base 
Structures. For parallel processing, more work is 
needed to build up a repertoire of techniques and 
data structures, to evaluate their performance, and 
to develop guidelines for their use. This background 
motivates the following proposed lines of research: 
{1} to identify and add to the set of techniques for 
concurrent manipulation of structured data, (2) to 
characterize the correctness issues associated with 
the application of these techniques, (3) to suggest a 
set of relevant performance measures, and (4) to 
provide a simulation tool for comparing solutions. 


#117 


Extension and Applications of a Theory of Information Flow #119 

and Analysis: The Use of Information in a Decision-Making Electronic Information Exchange in Legislative Technical 
Environment. Ohio State University, Department of Computer Research. Participation Systems, Inc., 43 Myrtle Terrace, 
Science, 1314 Kinnear Road, Columbus, OH 43210. Marshall Winchester, MA 01890. Chandler Harrison Stevens. 
C. Yovits. $105,670 for 24 months beginning July 16, 1979. $150,575 for 18 months beginning July 24, 1978. 


The purpose of this research is to develop a basic This project is a trial of electronic exchange of 
theory of information flow and analysis. It builds information within the community of legislative 
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technical researchers. This community took form 
in the early 1970’s due to three intersecting tend- 
encies: {1} development of state legislative staffs in 
general, (2) development of interests among scientists 
and engineers in public policy, and (3) development 
of various non-electronic networking techniques 
particularly suited to sharing technical information 
about legislative issues. 

Electronic information exchange will be used in 
an interstate exchange of both substantive and 
process information. Initial emphasis will be given 
to exchanging technical inquiries from legislators 
and the responses obtained from scientists and 
engineers. In these inquiries formatted response briefs 
will be used. The format will be revised on-line during 
the course of the project, resulting in a nationally 
acceptable convention for the preparation of response 
briefs. 

The body of data created during the project will 
be organized using the electronic medium. This new 
body of literature will be a valuable asset to legislative 
researchers which should continue to grow once 
the project ends. 


#120 


Correlation of Language Structure with Information. University 
of Pennsylvania, Philadelphia, PA 19104. Zellig S. Harris. 
$163,329 for 24 months beginning April 10, 1978. 


The purpose of this research is to determine the 
structure of language-borne information. It will first 
attempt to establish classes of words which have 
special information characteristics. It will then 
assemble these words into combinations which 
constitute assertive sentences. The work will be based 
on recent results concerning the structure of infor- 
mation. These results have shown that each sentence 
of a language is derived, in a regular and computable 
way, from a series of more simple statements. It 
has also been shown that these simple statements 
contain all of the information which is contained 
in the original sentence. Thus, the information 
content of any statement can be derived from the 
information content of the underlying structures 
of which the more complex statement can be reduced. 
The key point is, however, that the derivation of 
these simple statements do not depend upon human 
judgment but can be accomplished by a computer 
program for grammatical decomposition. 

The fact that just a few types of successions and 
combinations of these simple statements suffice for all 
language-borne information will enable this research 
to provide insights essential to a theory of infor- 
mation. From the standpoint of applications, the 
fact that the relationship of information to syntax 
has been established means that it will be possible 
for a computer to operate on informational content 
as opposed to merely word processing. In other 
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words, the development of a systematization of 
factual information on the basis of a computable 
formulation of language structure should be of major 
service toward computer processing of the actual 
information contained in factual, and especially in 
scientific, material. 


#121 


Alternative Representations of Information in Microeconomic 
Systems. University of Pennsylvania, Department of Eco- 
nomics, 409 Franklin Building, 3451 Walnut Street, Phila- 
delphia, PA 19104. Beth Elaine Allen. $77,987 for 24 months 
beginning July 30, 1979. 


To date, little research has been undertaken that 
defines information in general terms for economic 
theory. This project uses mathematical economics 
to analyze some aspects of the role of information in 
microeconomic theory and examines the relation- 
ships between information and microeconomics. 

The project is divided into three phases. The 
definition of information as a forecast will be 
compared to other ways to model information. The 
implications of the forecast approach for the notions 
of similarity of information, more information, and 
dispersed (or more dispersed) information will be 
evaluated. The second phase consists of an analysis of 
a microeconomic model of general equilibrium in 
which information consists of forecasts and focuses 
on the sense in which such equilibria can be rational 
when forecasts are hypothesized to be dispersed. 
And the third phase will include research on the 
extension of informational economic equilibrium 
results to random references in economics and other 
game-theoretic solution concepts which are relevant 
to economic theory. 


#122 

Computer Science and Computer Engineering Research Equip- 
ment. University of Pennsylvania, Department of Computer 
and Information Science, Philadelphia, PA 19104. Aravind K. 
Joshi. $175,100 for 12 months beginning July 2, 1979. 


NSF is providing support to the University of 
Pennsylvania for the purchase of a VAX 11/780 
computer system. This system will provide the 
computing power necessary to conduct several 
specific research projects in: (1) <utural language 
processing, (2) applicative languages, (3) distributed 
data bases. The equipment is not intended for general 
purpose computation, but will be a dedicated facility 
devoted to specific resez zh projects. 


#123 


Research in Natural Language Man-Machine Interaction. 
University of Pennsylvania, Department of Computer and 
Information Science, 3451 Walnut Street, Philadelphia, PA 


19104. Bonnie L. Webber. $83,598 for 24 months beginning areas. The first goal is to advance the study of 
August |, 1979. embedded join dependencies by using Boolean 
matrices. Specifically, the following problem will 
be addressed: given a set of dependencies, find an 
algorithm to determine additionally implied depend- 
encies. A second goal is to determine when a relation 
can be represented by another relation satisfying 
data depender.cies known to exist in the database. 
Finally, it is proposed to consider the consequences of 
using these substitute representations and to devise 
algorithms for applying them to a database. 


An important research question in regard to 
effective information transfer in co-operative natural 
language man-machine interactions is how a system 
should respond when it is unable to deal with a 
user's request. A maximally helpful response would 
be one which would not frustrate the user or try to 
outguess him, but rather would guide the user to a 
relevant answerable question. Such a response would 
require that a system (1) understand why the user's 
request is unanswerable, (2) recognize what it would 


need for an answerable request or what answerable Hts 

requests are in some useful way related to the given Training Modules for Use of Scientific and Technical Infor- 

one, and (3) communicate to the user both the mation Services—Phase II. University of Pittsburgh, Graduate 

problem and its allowable remedies in a way that is School of Library and Information Sciences, Pittsburgh, 

both effective and efficient. PA 15260. Elaine Caruso. $67,875 for 12 months beginning 

This project will investigate these requirements February 23, 1978. 

in the context of linguistic notions of repair and Currently, because of the non-standardization 

computational notions of meta-knowledge. “Repair” existing among computer-stored data bases and the 

is the name given by socialogists and sociolinguists retrieval systems providing access to them, placing 

to the set of processes involved in correcting mis- users of these systems dirctly on-line, without 

understandings in discourse which may arise as a extensive training, is impractical. Most user training 

function of either the speaker or the listener. In aids now available are system-specific and directed 

this case “meta-knowledge” refers to a system's towards information intermediaries, such as librar- 

knowledge of itself as a model of a particular domain ians, rather than to scientists and engineers or others 

and would include the system’s knowledge about with technical information needs. 

the extent of its coverage (or lack of coverage) of This research, initiated in FY 1976, under SIS76- 

that domain. 09538, will continue the design, development, and 

evaluation of training materials suitable for a wide 

#124 range of information users. They provide step-by- 
Completions of Relations and Embedded Join Dependencies Step on-line instruction aimed at upgrading users’ 


ability to identify data bases suitable for their needs 


in Relational Databases: A Study Using Boolean Matrices. : 
and access them through appropriate search systems. 


Pennsylvania State University, Department of Computer , ' . ; 
Science, 207 Old Main, University Park, PA 16802. Karen Computer-assisted learning sequences and simulated 


H. Chase. $36,630 for 12 months beginning June 26, 1980. versions will concentrate on evaluation of the 
learning and practice modules developed and in- 


Database management systems are being built surance of their transportability to other computer 
around increasingly large and complex databases. environments. Results will be reported in the lit- 
To provide flexible and effective responses to the erature and the training packages developed will 
storage, update, and retrieval demands placed upon be made available to the information and education 
them, these systems must be designed with a well- community. 
organized database model. Such models are provided 
by the relational approach to database design, a #126 
powerful method of representing data at the logical 


A Survey of Manpower Requirements for Scientific and 
Technical Communication. University of itsburgh, Inter- 
disciplinary Department of Information Science, Pittsburgh, 
PA 15260. Anthony Debons. $101,417 for 7 months begin- 
ning June 26, 1978. 


level. 

In the relational approach, a key role is played 
by various semantic constraints which induce so- 
called data dependencies (e.g., a given plane on a 
given flight has a unique first pilot). Such depend- 


encies make it desirable to replace complex tables The purpose of this research is to determine which 
by smaller tables which comprise independent units communication functions are performed in all sectors 
of information and are especially practical for of the economy, and the number involved in each 
updates. The study of dependencies is an area of function. Two major tasks are described: (1) to 
great theoretical and practical interest. develop a taxonomy which will permit the delineation 

The purpose of the proposed work is to extend of information work on the basis of functions 
the knowledge of these data dependencies in three performed, and will facilitate the classification of 


occupations in terms of their information function 
content; and (2) to conduct a survey of the number 
of persons (and their costs) engaged in information 
functions at a professional level. 


#127 


A Survey of Manpower Requirements for Scientific and 
Technical Communication. University of Pittsburgh, \nter- 
disciplinary Department of Information Science, Pittsburgh, 
PA 15260. Anthony Debons. $20,440 for 12 months begin- 
ning April 2, 1979. 


This ts a supplement to the previous continuing 
grant described above. 


#128 


A Survey of Manpower Requirements for Scientific and 
Technical Communication. University of Pittsburgh, Inter- 
disciplinary Department of Information Science, Pittsburgh, 
PA 15260. Anthony Debons. $126,874 for 12 months 
beginning June 20, 1979. 


This is the final supplement to a continuing grant 
described above. 


#129 


Support for U.S. Participation in NATO Advanced Study 
Institute in Information Science. University of Pittsburgh, 
Interdisciplinary Department of Information Science, 
Pittsburgh, PA 15260. Anthony Debons. $15,189 for 12 
months beginning August 23, 1979. 


The Scientific Affairs Division, Science Com- 
mittee, of the North Atlantic Treaty Organization 
has approved a grant for the conduct of the Third 
Advanced Study Institute in Information Science 
to be held in Crete, Greece, August I-11, 1978. The 
Institute will detail the foundations of information 
science as they relate specifically to the design of 
information systems. The purpose of this award ts 
to enable eight U.S. scholars, primarily young 
researchers, to participate. It will also partially 
support preparation of the Institute’s proceedings, 
as well as other documents to ensure wide dissem- 
ination of the results. 


#130 


The Economics of Information Transfer Using Resource Sharing 
Networks— Network Functions. University of Pittsburgh, 
Communications Program, Office of the Director, Pittsburgh, 
PA 15260. Allen Kent. $85,900 for 18 months beginning 
January 10, 1978. 


The objective of this proposal is to obtain a better 
understanding and assessment of the functions of 
library resource sharing networks and to identify 
and define those functions that may be relevant to 
the eventual development of a model of such resource 
sharing networks. 
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This research effort is designed to provide library 
managers and planners wit) information that will 
assist them in determining wheiher it is to their 
advantage to participate in a resource sharing 
network. 


#131 


Economics of Information Transfer Using Resource Sharing 
Networks—Network Modeling Simulation. University of 
Pittsburgh, Office of Research, Sponsored Projects Admin- 
istration, 1028 Cathedral of Learning, Pittsburgh, PA 15260. 
Allen Kent. $233,496 for 30 months beginning August 13, 
1979. 


The objectives of this research are to: {1} construct 
a simulation model capable of accepting data de- 
scribing network characteristics, functions, rates of 
activity, financial resources and policies; (2) produce 
Output data that would provide comparative per- 
formance measures for alternative choices of resource 
sharing networks; and (3) prepare and validate a 
metamodel for resource sharing networks. This 
model would identify the common parameters of 
networks, independent of network type, operations, 
and type of materials. 

This research is designed to provide library and 
information center managers and planners with a 
tool that will assist them in determining whether it 
is to their advantage to participate in a resource 
sharing network. 


#132 


Research on Error-Correcting Techniques in Automated 
Information Processing. Purdue University, School of Electrical 
Engineering, West Lafayette, IN 47907. King-Sun Fu. 
$151,794 for 36 months beginning November 5, 1979. 


This project will develop probabilistic analyses 
lo correct errors in information. Error-correcting 
is important because in practical applications, a 
certain amount of noise or distortion occurs in the 
communication, storage, and retrieval of information 
or in the measurement or processing of physical 
patterns and languages. 

Comprehensive study of such matters requires 
abstract formulation of information pattern struc- 
tures and processes. The proposed work utilizes 
formal languages, a leading example of such an 
abstraction. They express certain aspects of the 
syntax (structure) of both human and computer 
languages and permit the systematic study of proc- 
essing principles, including parsing, that is, the 
determination of whether or not a given sequence 
of words is a possible sentence and, if so, what its 
structure is. 

The proposed approach to recognition of patterns 
and errors in information exploits an analogy 
between the structure of patterns and the syntax of 


language. It thereby seeks to take important results 
already obtained in formal languages and adapt 
them to apply in the domain of pattern recognition. 
The investigator has already had considerable success 
in this area and proposes to broaden the class of 
formal languages involved. 

Specifically, and in technical terms, this project 
will extend previous results on stochastic error- 
correcting parsing for context-free string languages to 
context-sensitive string languages, languages gen- 
erated by attributed grammars, and tree languages. 
Cluster analysis of syntactic information :epresented 


The prescriptive nature of the research will permit 
one to use the results to show how a data provedure 
can increase the effectiveness of information transfer 
by identifying the information flow network, de- 
termining the probability of information flow be- 
tween the network nodes, finding measures of system 
reliability and performing sensitivity analyses to 
identify the critical components of the system. The 
result will be a closer congruence of the objectives 
of the data producer and the requirements of the 
user. 


by strings and trees also will be studied. #136 
#133 Information Processing of Logographic and Alphabetic Writing 
Symbols. Rensselaer Polytechnic Institute, Department of 


Analysis and Application of Information Productivity Psychology, Troy, NY 12181. Yao-chung Tsao. $27,850 
Models. Purdue University, West Lafayette, IN 47907. for 12 months beginning August 6, 1979. 


Ferdinand F. Leimkuhler. $108,099 for 24 months beginning 
January 16, 1980. 


This study is aimed at clarifying the logical 
properties of productivity models, developing sta- 
tistical procedures for analyzing productivity data 
and demonstrating useful applications of these 
models to solving problems associated with the 
design and control of information systems. 

The researchers will make a systematic comparison 
of the various mathematical models that have been 
proposed to describe the productivity or usefulness of 
items of infurmation. The basis for these comparisons 
will be the models’ logical or conceptual differences, 
their usefulness for making statistical inferences 
about productivity characteristics from sample data, 
and their usefulness for improving the planning and 
operation of information systems. 


#134 


Analysis and Application of Information Productivity Models. 
Purdue University, West Lafayette, IN 47907. Ferdinand F. 
Leimkuhler. $1,819 for 0 months beginning June 6, 1980. 


Supplement to NSF Grant IST79-11893 


#135 
A Reliability Theoretic Construct for Document Usage and 


As early as 1961 research indicated that the two 
hemispheres of the human brain function differently. 
It is now generally accepted that the dominant 
hemisphere is more efficient in processing verbal 
material while the non-dominant hemisphere process 
non-verbal (visuo-spatial) material more efficiently. 
This project will study information processing with 
respect to some aspects of hemispheric asymmetry 
by comparing the processing of two different types 
of writing symbols—alphabetic (English) and 
logographic (Chinese). It has been suggested that 
Chinese characters may be processed initially as 
pictorial stimuli (by the non-dominant hemisphere) 
and later, through use and exposure, be processed 
analytically in the dominant hemisphere. 

The general purpose of this project is to investigate 
the differential information processing of Chinese 
and English words in relation to brain hemisphere 
functions. Chinese-English bilinguals will serve as 
subjects and be tested in each language for three 
conditions: interference, reading, and color naming. 
Better understanding of such cognitive processes is 
expected to contribute to the design of more efficient 
man-machine interfaces for computer-based infor- 
mation processing systems. 


Information Flow. Rensselaer Polytechnic Institute, School #137 

of Management, Troy, NY 12181. Pasquale Sullo and William Recognizing Intent in Natural Language. University of 

A. Wallace. $72,800 for 12 months beginning June 22, 1979. Rochester, Computer Science Department, 617 Math Science 
The research is to develop an analytical model, Building, Rochester, NY 14627. James F. Allen. $80,000 


with associated algorithms, of information flow for 24 months beginning August 15, 1980. 


networks that utilizes the probabilistic framework 
of reliability theory. Document usage and infor- 
mation flow networks previously developed will be 
analyzed from a reliability theoretic approach and 
measures of systems reliability will be determined 
using existing computer algorithms. The applicability 
of this approach will be tested by using “typical” 
networks and performing sensitivity analyses. 


The proposed research involves the investigation of 
natural language processing techniques and their 
use in the design of “intelligent” information retrieval 
systems. Proper semantic interpretation 0 
queries often relies on assumptions about the 
speaker's goals. This project will examine the ways 
in wisich these goals are contextually constrained, 
inferentially derived, and ultimately applied to 


understanding processes. A set of plan recognition 
heuristics will be developed, tested and evaluated. 

More specifically, the research aims to develop a 
formalism for recognizing plans as they occur and 
can be analyzed in a dialogue. This formalism, a 
set of plan recognition heuristics, will be implemented 
in a system for simple exchanges, and then extended 
and evaluated in a system of long and more complex 
dialogues. 

It is concerned with parsing real inputs to con- 
versation that may not be syntactically well-formed, 
and devising strategies to efficiently recognize a 
speaker's plan. 


#138 


Services in Industry for the Analysis and Evaluation of 
Information. Rutgers University, New Brunswick, NJ 
07012. Homer J. Hall. $49,624 for 24 months beginning 
August 30, 1978. 


This project is designed to examine the role of 
information analysis in the information transfer cycle 
in industry. Several tasks are identified: {1} define 
and describe information analysis in industry; 
(2) provide a critical review of conditions needed for 
information analysis activities to succeed: (3) make 
statistical correlations based on the attitude of users: 
and (4) develop a guide for the potential user and 
discuss implications for information policy. 


#139 


Design for Evaluating the Centrality of Basic Information 
Science Research to Research Users in Organizations. Uni- 
versity of Southern California, Annenberg School of Com- 
munication, Los Angeles, CA 90007. Thomas H. Martin 
and James A. Danowski. $28,589 for 6 months beginning 
August 23, 1978. 


This six-month award provides for the develop- 
ment of a design to carry out an evaluation of the 
infor,.cation science research program. The approach 
includes: (1) identification of key individuals repre- 
sentative of the information science community; and 
(2) the development of a set of techniques to assess 
the importance of various information science 
research topics of these key individuals. Products 
of the award will include: (1) specification of the 
population of organizations containing users of 
information science research; and (2) preliminary 
instrument design for isolating key individuals and 
for assessing the importance of various information 
science research projects to them. 

Several design studies, in addition to this one, 
are being conducted concurrently. Phase Il awards 
will implement the most promising evaluation designs 
produced. 
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#140 


Economics of Computer Communication Networks. Stanford 
University, Engineering-Economic Systems Department, 
Stanford, CA 94305. Donald A. Dunn. $150,000 for 12 
months beginning August 3, 1978. 


The research program in the economics of 
computer-communication networks and computer- 
based information services will be undertaken. This 
program would continue research carried on during 
the past five years under a predecessor grant. Studies 
will be made of the role of information in the U.S. 
economy, networks as information markets, and 
various aspects of the production and consumption 
of information services. These studies will focus on 
the economics of the information services and 
products that computer networks make possible. 

The major theme of the work under this grant 
will be to continue to develop policy-oriented models 
of the information sector of the economy. Projects 
on the special nature of information as a product, 
on information service markets, on the distribution 
and role of information in non-market economics 
and regulated markets, on the special features of 
international information markets, and on the 
structure and behavior of the information sector 
of the U.S. economy are proposed. 

This ts continuation of work begun under project 
MCS78-07973 AO3 and MCS78-02272. 


#14) 


Economics of Computer Communication Networks. Stanford 
University, Engineering-Economic Systems Department, 
Stanford, CA 94305. Donald A. Dunn. $157,172 for 12 
months beginning July |, 1979. 


Supplement to the award described above. 


#142 


Economics of Computer Communication Networks. Stanford 
University, Engineering-Economic Systems Department, 
Stanford, CA 94305. Donald A. Dunn. $166,458 for 12 
months beginning August |, 1980. 


Supplement to award described above. 


#143 


Theoretical Basis for the Development of Document Preparation 
Systems. Stanford University, Stanford, CA 94305. Donald E. 
Knuth. $338,707 for 24 months beginning January 23, 1980. 


This proposal addresses problems related to the 
generation, representation, and display of the textual 
and graphical information that constitutes a 
“document”. 

Five main topics are considered for research: 
(1) Composition of graphical information (creation 
of figures); (11) Specification of the layout of a doc- 
ument onto two-dimensional media and optimization 


of such a process; (iii) Exploration of unifying models 
of the Document Preparation Process; (iv) Transfer 
of information between user and the system; and 
{v) Some computer-related aspects of Document 
Preparation Systems. 

The proposed methodology emphasizes the de- 
scription of problems, methods, and results using 
an information model common to all such problems 
and independent of particular devices or computer 
systems. The general thrust of the research will be 
to find a satisfactory scientific foundation for 
problems that have only been addressed and/or 
solved for very particular applications. In this way 
the theoretical methods and results as well as the 
prototype implementations that are the expected 
outcome of this proposal will provide a basis for 
much more powerful, general and easy-to-use 
Document Preparation Systems. 


Conceptually-driven (top-down) methods, data- 
driven (bottom-up) methods and hybrid methods 
have been developed to deal with contextual prob- 
lems in text recognition. The objective of this 
proposal is to test and develop top-down and 
bottom-up algorithms for using contextual knowl- 
edge in the automatic recognition of patterns of 
information in text. A process organization that 
will allow the use of diverse sources of knowledge 
will also be studied. 

The proposed experiments will employ bottom-up 
contextual information, such as letter- and work- 
conditional probabilities, and top-down contextual 
information in the form of dictionaries and know}- 
edge about the text source. The aim is to develop a 
dynamic system for interpreting text correctly and 
for automatic correction of errors and identification 
of patterns of information. 


#144 #146 


Developing a Model and Methodology for Evaluating 7rac- The Relational Theory of Databases. State University of New 
titioner Information Systems: The Case of Pacfornet. Stanford York, Computer Science Department, Stony Brook, NY 
University, Institute for Communication Research, Stanford, 11794. David Maier. $89,655 for 24 months beginning 
CA 94305. William Paisley. $65,000 for 12 months beginning August 8, 1979. 

September 21, 1978. 


The proposed research develops an analytical 
framework and methods for measuring the effec- 
tiveness of information systems. The multiple 
methods for evaluation are developed within the 
study through an in-depth assessment of a “prac- 
titioner™ information system, that is, a system set 
up to communicate research findings to practitioners 
within such disciplines as forest research, medicine, 
agriculture, education, etc. 

Results of the study, which uses a variety of survey 
interview and data analysis techniques to assess both 
individuals and organizational responses to the 
system, should result in better understanding of the 
functional relationships between characteristics of 
information systems and the characteristics of the 
individuals and organizations which adopt and use 
them. 

In addition, results to be reported in the infor- 
mation and behavioral science literature should begin 
to overcome the lack of studies in this area and 
provide a conceptual framework powerful enough 
to integrate currently existing narrowly focused 
assessment studies. 


The purpose of this research is to extend the theory 
of the relational data base as well as the model 
itself. The relational model is one of the easiest of 
the major database models for the non-technical 
user to grasp and also enjoys a strong theoretical 
foundation. In a relational database, all data are 
contained in tables where each column is labelled 
with an attribute such as name, age, salary etc. The 
rows of the table which contain the actual values 
for each of the attributes are called tuples. Thus, 
reading across the three columns described above, 
each row would contain a person's name, his age, 
and salary. The tables are called relations and a 
relational database is a set of relations. 

There are certain types of semantic information 
that are not currently incorporated in the model 
and this research will attempt to structure it in a 
format which the database designer can apply. An 
effort will also be made to extend the relational 
model in order to enable relational databases to 
become more self-descriptive. This would permit 
more types of queries to be answered as well as 
making modifications of the database format easier. 


#147 


#145 Montague Grammar and Database. State University of New 
Contextual Algorithms for Text Recognition. State University of York, Department of Computer Science, Long Island, NY 
New York, Department of Computer Science, Buffalo, NY, 11794. David S. Warren. $79,902 for 24 months beginning 
14260. Sargur N. Srihari. $55,800 for 24 months beginning August 6, 1980. 

July 10, 1980. 


Text recognition and processing is an impor- 
tant research area of information system design. 


It is proposed to relate a linguistic theory and a 
database theory, each of them the subject of intense 
current interest in its own field. 


The linguistic theory, Montague grammar, uses 
the principles of mathematical logic to specify syntax 
of a fragment of English, intensional models, and 
an interpretation function from phrases in the syntax 
to objects in the model. A key problem here is to 
specify a model theory complex enough to represent 
the real world as viewed by natural language. 

The database theory, that of relational databases, 
also must represent a part of the real world. The 
representation here should be readily subject to 
various manipulations and interrogations for dif- 
ferent user purposes. 

The research plan has two phases, corresponding 
to two directions of approach. The first begins with a 
Montague fragment of English and a corresponding 
intensional model. That model will make it possible 
to expiore how operations defined for relational 
databases can be used to interpret formulas of the 
intensiona! logic. The second phase will dea! wiih 
the converse problem, starting with a relational 
database and investigating what Montague grammar 
and intentional model that database determines. 
Such issues as the use of the Montague fragment 
as a query langage will then be approachable. 


hundred seventy-five ranking algorithms under 
various environments. The goals are to assess the 
performance-effectiveness of various terms weighting 
schemes and similarity measures and to determine 
the relative costs of implementing and using these 
algorithms. 


#149 


A Study of the Impact of Representatioc:. in Information 
Retrieval Systems. Syracuse University, School of Information 
Studies, 113 Euclid Avenue, Syracuse, NY 13210. Michael J. 
McGill. $155,613 for 24 months beginning February 5, 1980. 


Information systems are structured by the way 
information items are represented; this in turn 
determines the effectiveness and efficiency of the 
systems. This project will study different represen- 
tation schemes to determine the impact of repre- 
sentation on the retrieval characteristics, and suggest 
performance limits for an information system given a 
specific representation. At least seven representations 
will be investigated: (1) title word stems; (2) title- 
abstract word stems; (3) abstract word stems; 
(4) controlled vocabulary; (5) classification codes; 


(6) free indexing terms; and (7) free index stems. 


#148 The study will take place over a period of two 
An Evaluation of Factors Affecting Document Ranking by years and include both a theoretical and an experi- 
Information Retrieval /Systems. Syracuse University, Syracuse, mental component. The experimental study will 
NY 13210. Michael J. McGill. $67,857 for 12 months emphasize the isolation and testing of specific system 
beginning May 9, 1978. components of an information retrieval system. The 


Information systems attempt to supply lists of 
documents which appear to be responsive to the 
user's area of interest. In the case of manual systems it 
is possible to negotiate your search and narrow down 
the documents which you will eventually receive 
by discussions with someone such as a reference 
librarian. But the introduction of computerized 
systems has made the process somewhat less per- 
sonal. One solution has been to rank the output of 
these systems on the criterion of relevance to the 
user's query. Ranked outputs attempt to provide 
the user with information indicating that the closer 
a document is to the beginning of the output list, 
the more likely it is to be relevant to his query. 

A ranking algorithm defines an ordering on a 
set of documents, ordering the documents according 
to their degree of similarity to a query. In the simplest 
case the retrieved set of documents is ordered into 
two classes; those satisfying the retrieval criteria 
and those not satisfying the criteria. More compli- 
cated ranking algorithms may provide orderings 
of greater detail, creating up to N classes of output, 
where N is the number of documents retrieved. 

Many different ranking algorithms have been 
identified, but little empirical work has been done 
to analyze the effectiveness and efficiency of those 
algorithms. This project will examine some three 
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investigator will isolate, control and vary levels of 
variables so that main effects can be studied. The 
theoretical component consists of an iterative process 
for model building and testing, and formalizing 
hypotheses to predict behavior of the information 
systems. 


#150 


Representation, Retrieval, and Reorganization of Conceptual 
Information. University of Texas, Department of Psychology, 
Austin, TX 78712. Jane Perlmutter. $63,952 for 24 months 
beginning July 25, 1979. 


This project will explore alternative ways of 
representing and processing information using 
techniques developed in cognitive psychology and 
in computer science, specifically artificial intelligence. 
It would result in general principles about the 
structure and processing of information, and provide 
formal analyses of baseline information processing 
systems against which the human system can be 
compared. 

A working model of representation, retrieval, and 
reorganization of conceptual knowledge will be 
developed. This model should be consistent with 
psychological constraints and could provide new 
insights into human memory and cognition. 


#151 and improved interaction between the publishers 


Individual User Modeling in Computer Systems. University and users, 
of Texas, Austin, TX 78712. Elaine Rich. $56,315 for 24 This is a continuation of work begun under Grant 
IST77-19308. 


months beginning August 4, 1980. 


The aim of this proposal is to investigate the #153 


elements and structure of information retrieval 
Workshop to Compare and Synthesize Evaluations of Computer- 


system which contain models of individual users. . . : 
The proposed work involves a set of stereotypes in Based Information Exchange. Upsala College, East Orange, 
NJ 07019. Starr Roxanne Hiltz. $43,711 for 6 months 


a hierarchical arrangement and builds on previous _ 
research conducted by the Principal Investigator beginning June 10, 1980. 


to develop a system called “Grundy.” Stereotypes 
are similar to other knowledge structures, such as 
scripts, frames and schemas, and provide a way to 
represent predictable structures of characteristics 
and events. Emphasis on these knowledge structures 
has already resulted in a variety of advances in 
artificial intelligence. The proposed study will extend 
principles of knowledge structures and stereotypes 
to information retrieval systems about food and 
nutrition. 

The project involves developing, storing and 
mapping knowledge representations of users in a 
computerized retrieval system. User traits will be 
modelled as simple linear facets and represented 
on a linear numerical scale. A set of stereotypes 
will be developed to depict users of a generalized 
food and nutrition information system. Conceptual 
domain in the data base will be designed to match 
the interests, nutritional requirements, etc. of various 
stereotypes. 


Investigators of computer-based information 
exchange will hold a three-day workshop to compare 
and synthesize findings which they have obtained 
in twelve separate research projects. Nine of these 
projects were based on EIES (Electronic Information 
Exchange System), a research facility developed 
under NSF sponsorship. The other projects em- 
ployed PLANET, NLS AUGMENT, and a system 
under development at Bell Northern Research, 
respectively. A fifth system, CONFER, and an 
electronic mail system, HERMES, are also familiar 
to some of the participants. The report of this project 
will summarize each of the separate research projects 
and include three synthesis chapters. Most of the 
work of organizing and writing the report will be 
done by means of EIES. The three-day face-to-face 
meeting will supplement the participants’ computer- 
based communication and help them to focus their 
efforts on key aspects of the research under 
consideration. 


#154 


#152 Subgraph Identification Using Associative Techniques with 
Innovations in Engineering Publications. 7yynshare, Inc., 20705 Applications to Information Science and Chemical Structure 
Valley Green Drive, Cupertino, CA 95014. Elizabeth K. Investigation. Wayne State University Department of Electrical 
Michael. $3,695 for 24 months beginning December 15, 1978. and Computer Engineering and Computer Science, 5050 

This project will test an innovative electronic Cass Avenue, Detroit, MI 48202. Dharma P. Agrawal. 


alternative io the traditional paper-based communi- $61.470 for 24 months beginning July 30, 1979. 
cation of scientific and technical information. It 
concentrates on information which loses value over 
time; therefore, rapid dissemination ts very im- 
portant. Engineers in several aerospace organizations 
are provided with facilities for on-line access to a 
data base created from inputs of relevant, “time- 
sensitive” information. Several publishers are par- 
ticipating by converting their regular publication 
material into the electronic form. 

Data on the usefulness of the system will be 
gathered by pre-test and post-test surveys admin- 
istered to the subjects and to a control group; and 
by real time, on-line recording of the engineers’ use 
behavior. 

The tests are designed to validate a set of hypoth- 
eses regarding the time value of information, reduc- 
tion of information overload, increase in usefulness of 
information resources through electronic handling, 
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Information which is either initially displayed in 
graphic format or is subsequently encoded graph- 
ically (such as organic chemical compounds) can 
use subgraph identification for purposes of infor- 
mation analysis and retrieval as well as for other 
pattern recognition purposes. Subgraph identification 
consists of determining whether or not one graph 
appears as a subgraph of another. Thus, subgraph 
identification techniques can be utilized for sub- 
structure searches when dealing with the data base 
of organic chemical compounds. For example. 
substructure searching would permit the retrieval 
of only those compounds which contain a given 
substructure. 

The objective of this proposal is to improve the 
efficiency of subgraph searching techniques. Instead 
of the currently used algorithms this research will 
investigate the effectiveness of using distances 


between nodes in a graph as heuristics to guide the 
search. 


#155 


Lexical Ambiguity Resolution in Discourse. Wayne State 
University, Detroit, MI 48202. Michael K. Tanenhaus. 
$38.977 for 24 months beginning August 25, 1980. 


The widespread existence of words with more 
than one meaning provides an opportunity to 
investigate the processes by which listeners integrate 
an ambiguous word and its context in a sentence 
to determine the appropriate meaning. 

The research will examine the interaction between 
different types of contextual information, lexical 
retrieval, and comprehension processes, and will 
make use of a time delay naming paradigm. Unlike 
previous methodologies used to study lexical 
ambiguity, this procedure is sensitive to different 
Stages in processing as well as their time course. 
The studies are designed to test and elaborate a 
preliminary model of certain aspects of the human 
word storage and retrieval process which is expected 
to aid in the development of advanced speech 
understanding systems. 


#156 


Accuracy of Data on Information Exchange by Electronic 
Network. West Virginia University, Morgantown, WV 
26506. H. Russell Bernard. $39,403 for 12 months beginning 
August 14, 1978. 


Scientists desire to improve the methodology of 
information science research, particularly the meth- 
odology for identifying and studying information 
networks. They also desire to develop measures of 
reliability and validity for network data. This project 
assesses the nature and degree of human error in 
data gathering by sociometric techniques from 
information networks. It will also assess the quality of 
the data that are collected by providing a comparison 
of sociometric survey data with electronically 
recorded logs of social interactions. The research 
will be carried out with the cooperation of volunteers 
from research groups now using the electronic 
information exchange system at the New Jersey 
Institute of Technology. 


#iS7 


A Study of the Distribution, Processing, and Use of Scientific 
and Technical Information in Research Organizations. ( ni- 
versity of Wisconsin, Madison, W1 53706. George P. Huber. 
$91,214 for 12 months beginning January 23, 1980. 


This research is concerned with the distribution, 
processing, and use of information in organizations. 
There are three major objectives of the research: 
(1) develop and test a sei of models of information 
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distribution, processing and use in organizations: 
(2) develop and test transferable procedures for 
mapping information flows in an organization, 
especially decision-related information flows: and 
(3) develop and test transferrable procedures for 
developing and testing diagnostic, predictive mode.s 


of an organization's information system. 


#158 


Distributed Data Base Design. l niversity of Wisconsin, School 
of Business Administration, P. O. Box 413, Milwaukee, WI 
$3201. William D. Haseman. $243.65! for 18 months 
beginning August 4, 1980. 


Because of recent advances in computer tech- 
nology, many large central computer facilities are 
being replaced by clustcis of smaller computers. 
New kinds of softwar: are required for the effective 
use of networks of databases that are geographically 
distributed. 

This project will develop a methodology and 
model for designing distributed databases. The 
modelling will be undertaken as a two-stage process 
involving first the design of the database (local 
model) and then the distribution of that database 
{network model}. The design will be accomplished 
using an integer program which minimizes the sum 
of access and storage costs. The distribution process 
uses a location model for determining the number 
of copies of each file and then their locations. Initially 
these models will be solved sequentially in an 
interactive manner. Since this will only produce a 
local optimum, it ts then planned to use the know!l- 
edge gained to develop a single model which will 
combine both steps into one. Solution techniques 
for this model will be developed and both models 
will be tested using a medical database. 

This project was originally funded as NSF Grant 
IST79-18472 to Carnegie-Mellon Us:-<cisity, and 
was transferred to the University of Wisconsin, 
Milwaukee when the investigator accepted a faculty 
position there. 


#159 


The Critical Review Process: Impact Upon Scientific and 
Technological Information. Worcester Polytechnic Institute, 
Department of Management, Worcester, MA 01609. Arthur 
Gerstenfeld and Richard H. Franke. $147,255 for 24 months 
beginning August 24, 1979. 


The major objective of the project is to establish 
a conceptual and quantitative model which describes 
the process and use of critical reviews in the devel- 
opment of science and technology. This is to be 
accomplished through the analysis of a combination 
of eight hypotheses which treat the functions of 
critical reviews and particular characteristics of 
writing and using them. The research will investigate 


the critical review process in three areas representing 
physical science (biochemistry), social science (social 
science of work and organization), and engineering 
{electric and electronic engineering). Data are to 
be obtained in each area: (1) using forward tracking 
of development through major journals; (2) through 
reference and citation analyses; and (3) using direct 
contact with critical review writers, users, and 
non-users. Analytical methods include graphic 
representation, t-test Comparison, cross-sectional 
analysis, longitudinal analysis using adjustment 
procedures of econometrics, and replication and 
model building employing all of these data and 
techniques. The examining and contrasting of very 
different information experiences are expected to 
yield strong evidence regarding effects of the critical 
review process upon scientific and technological 
information. 


memory structures in a computer program that 
vuoderstands narratives. 


#161 


Theoretical Foundations of Natural Language Processing. } aie 
University, Department of Computer Science, P. O. Box 
2158, Yale Station, New Haven, CT 06520. Roger C. Schank. 
$183,785 for 24 months beginning September 17, 1980. 


The objective of this research is to determine 
whether the fundamental constructs of human 
knowledge representation are invariant across lan- 
guages. There are many practical implications which 
would derive from this finding: one of the most 
obvious being in the area of knowledge-based 
automatic text translation. Unlike previous attempts 
at machine translation, which involved elaborate 
transformations of syntactic structures between each 


#160 pair of languages, text understanding systems could 
Processing Effects M Recall tees be developed based upon an intermediary, language- 
vont ant | Yale University, free, meaning representation. The focus of this 


Department of Computer Science, P. O. Box 2158, Yale 
Station, New Haven, CT 06520. Wendy G. Lehnert, Roger 
C. Schank, Robert P. Abelson. $291,988 for 24 months 


work will be transformations from natural language 
forms to conceptual structures and back to natural 
language. The use of more than one language will 


beginning August 3, 1979. allow a check on the important issue to the extent 
This research in the field of natural language to which the meaning representations are language- 
processing is concerned with the nature of mech- independent. 


anisms which are capable of performing inferences 
similar to those which comprise human reasoning. 
Such mechanisms would permit the development 
of information retrieval systems with response 
capabilities much more attuned to user knowledge 
requirements than can be provided by existing 
sysiems. 

Using techniques of artificial intelligence, this 
project will investigate the ways that processing at 
the ume of understanding can affect recall behavior in 
the tasks of question answering and paraphrase 
production. This investigation will focus on the 
design, and implementation of process-oriented 


The work will utilize two different computational 
understanding systems, one of them already well- 
developed, the other to be further refined. Use of 
the two systems will make it possible to investigate 
two very different aspects of the understanding and 
generation processes. One of the systems ts capable 
of re-expressing complex descriptions of events, 
because it can represent and manipulate sophisti- 
cated aspects of human interaction. The other system 
has a much more detailed database of domain- 
specific knowledge. Both of these aspects must be 
well-developed in order to achieve an adequate 
language-independent meaning representation. 
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